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PREFACE 


The original version of this book submitted to the David A. 
Wells Prize Committee of Harvard University in 1941 carried the 
subtitle, “The Operational Significance of Economic Theory.” At 
that time most of the material presented was already several years 
old, having been conceived and written primarily in 1937. Further 
delay in publication has been necessary because of the war, and 
because of the addition of supplementary treatise-like material 
going beyond the original conception of the work as indicated by 
its subtitle. 

Because of the pressure of war work I have not been able to 
do full justice to the literature of the last few years, nor even to 
include all of the developments of my own thinking. Fortunately, 
the passage of time has dealt kindly with the analysis contained 
here, and where it abuts upon the topics treated in Professor 
Hicks’s masterly Value and Capital, the similarity in point of view 
has been reassuring. 

My greatest debt is to Marion Crawford Samuelson whose con- 
tributions have been all too many. The result has been a vast 
mathematical, economic, and stylistic improvement. Without her 
collaboration the book would literally not have been written, and 
no perfunctory uxorial acknowledgment can do justice to her aid. 
Nor can the quaint modern custom of excluding the value of a wife’s 
services from the national income condone her exclusion from the 
title page. 

My thanks for prolonged stimulation over many years must go 
out to Professors Schumpeter, Leontief, and E. B. Wilson, while 
each of a legion of Harvard graduate students has left his mark 
upon what follows. The reader will note my dependence upon the 
sterling contribution to Welfare Economics of Professor Abram 
Bergson. Grateful acknowledgment is made to the Social Science 
Research Council and to the Society of Fellows of Harvard Uni- 
versity for the opportunities they provided for pursuit of inde- 
pendent research, and to the Department of Economics of Harvard 
University for their courteous acceptance of the wartime delays 
in publication. 
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PART I 



CHAPTER I 


INTRODUCTION 

The existence of analogies between central features of various theories 
implies the existence of a general theory which underlies the partiadar 
theories and unifies them with respect to those central features. This 
fundamental principle of generalization by abstraction was enunci- 
ated by the eminent American mathematician E. H. Moore more 
than thirty years ago. It is the purpose of the pages that follow 
to work out its implications for theoretical and applied economics. 

An economist of very keen intuition would perhaps have sus- 
pected from the beginning that seemingly diverse fields — produc- 
tion economics, consumer’s behavior, international trade, public 
finance, business cycles, income analysis — possess striking formal 
similarities, and that economy of effort would result from analyzing 
these common elements. 

I can make no claim to such initial insight. Only after labori- 
ous work in each of these fields did the realization dawn upon me 
that essentially the same inequalities and theorems appeared again 
and again, and that I was simply proving the same theorems a 
wasteful number of times. 

I was aware, of course, that each field involved interdependent 
unknowns determined by presumably efficacious, independent equi- 
librium conditions — a fact which has always been generally real- 
ized. But, and this leads me to the second fundamental purpose 
of this work, it had not been pointed out to my knowledge that 
there exist formally identical meaningful theorems in these fields, 
each derived by an essentially analogous method. 

This is not surprising since only the smallest fraction of eco- 
nomic writings, theoretical and applied, has been concerned with 
the derivation of operationally meaningful theorems. In part at 
least this has been the result of the bad methodological precon- 
ceptions that economic laws deduced from a priori assumptions 
possessed rigor and validity independently of any empirical human 
behavior. But only a very few economists have gone so far as 
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The majority would have been glad to enunciate meaningful 
theorems if any had occurred to them In fact, the literature 
abounds with false generalization 

Wt do not have to dig deep to find examples Literally 
hundreds of learned papers have been written on the subject of 
utility Take a little bad psychology, add a dash of bad philosophy 
and ethics, and liberal quantities of bad logic, and any economist 
can proxe that the demand curve for a commodity is negattvely 
inclined His instinct is good , the attempt to derive a meaningful 
useful theorem is commendable — much more so than the innocuous 
position that utility is always maximized because people do what 
they do How refreshing then is a paper like Slutsky’s 1 which 
attempted, with partial success, to deduce once and for all the 
hypotheses upon price quantity budget behavior implied in the 
utility approach 

The economist has consoled himself for his barren results with 
the thought that he was forging tools which would eventually yield 
fruit The promise is always in the future, we are like highly 
trained athletes who never run a race, and in consequence grow 
stale It is still too early to determine whether the innovations 
in thought of the last decade will have stemmed the unmistakable 
signs of decadence which were clearly present in economic thought 
prior to 1930 

By a meaningful theorem I mean simply a hypothesis about 
empirical data which could conceivably be refuted if only under 
ideal conditions A meaningful theorem may be false It may 
be valid but of trivial importance Its validity may be mdetenm 
nate, and practically difficult or impossible to determine Thus 
with existing data, it may be impossible to check upon the hy 
pothesis that the demand for salt is of elasticity — 1 0 But it is 
meaningful because under ideal circumstances an experiment could 
be devised whereby one could hope to refute the hypothesis 

The statement that if demand were inelastic, an increase m 
price would raise total revenue is not a meaningful theorem in this 
sense It implies no hypothesis — certainly not even that a demand 
exists which is inelastic — and is true simply by definition It may 
possibly ha\ e had a certain ‘'psychological” usefulness in helping 
1 F Slutsky Sulla teona del bilancio del consumatore Gtornale digit Economtsh 
’ f 19 IS) 1 26 
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economists ask the right questions of the facts, but even here I 
have some doubts. 

In this study I attempt to show that there do exist meaningful 
theorems in diverse fields of economic affairs. They are not de- 
duced from thin air or from a priori propositions of universal truth 
and vacuous applicability. They proceed almost wholly from two 
types of very general hypotheses. The first is that the conditions 
of equilibrium are equivalent to the maximization (minimization) 
of some magnitude. Part I deals with this phase of the subject in 
a reasonably exhaustive fashion. 

However, when we leave single economic units, the determina- 
tion of unknowns is found to be unrelated to an extremum position. 
In even the simplest business cycle theories there is lacking sym- 
metry in the conditions of equilibrium so that there is no possibility 
of directly reducing the problem to that of a maximum or mini- 
mum. Instead the dynamical properties of the system are speci- 
fied, and the hypothesis is made that the system is in "stable” 
equilibrium or motion. By means of what I have called the Corre- 
spondence Principle between comparative statics and dynamics, 
definite operationally meaningful theorems can be derived from so 
simple a hypothesis. One interested only in fruitful statics must 
study dynamics. 

The empirical validity or fruitfulness of the theorems, of course, 
cannot surpass that of the original hypothesis. Moreover, the 
stability hypothesis has no teleological 2 or normative significance ; 
thus, the stable equilibrium might be at fifty per cent unemploy- 
ment. The plausibility of such a stability hypothesis is suggested 
by the consideration that positions of unstable equilibrium, even 
if they exist, are transient, nonpersistent states, and hence on the 
crudest probability calculation would be observed less frequently 
than stable states. How many times has the reader seen an egg 
standing upon its end? From a formal point of view it is often 
convenient to consider the stability of nonstationary motions. 

In a good deal of Part II the dynamical behavior of systems is 
analyzed for its own sake, regardless of implications for compara- 
tive statics. And in the last chapters of Part I, I have gone beyond 
the original conception of the book to include such subjects as 

2 L. J. Henderson, The Order of Nature (Cambridge, Massachusetts: Harvard 
U niver sity Press, 1917). 
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welfare economics Although the logical content of the theorems 
enunciated here is different there is an underlying unity of method 
In the beginning it \\ as hoped that the discussion could be made 
nontechnical Very quickly it became apparent that such a pro 
cedure while possible would involve a manuscript many times 
the present size Moreover I have come to feel that Marshall s 
dictum that it seems doubtful whether any one spends his time 
well in reading lengthy translations of economic doctrines into 
mathematics that have not been made by himself should be 
exactly reversed The laborious literary working o\cr of essen 
tially simple mathematical concepts such as is characteristic of 
much of modern economic theory is not only unrewarding from 
the standpoint of advancing the science but invoK es as well mental 
gymnastics of a peculiarly depraved type 

On the other hand I have attempted to avoid all mathematical 
flourish and the pure mathematician will recognize ill too readily 
the essentially elementary character of the tools used My own 
interest in mathematics has been secondary ind subsequent to my 
interest in economics Nevertheless the reader may find some 
parts hard going To lighten the way I have placed the purely 
mathematical theorems in two separate appendices of which the 
second gives a reasonably self contained introduction to the theory 
of difference equations 



CHAPTER II 


EQUILIBRIUM SYSTEMS AND COMPARATIVE 

STATICS 

Most economic treatises are concerned with either the descrip- 
tion of some part of the world of reality or with the elaboration of 
particular elements abstracted from reality. Implicit in such 
analyses there are certain recognizable formal uniformities, which 
are indeed characteristic of all scientific method. It is proposed 
here to investigate these common features in the hope of demon- 
strating how it is possible to deduce general principles which can 
serve to unify large sectors of present day economic theory. 

In every problem of economic theory certain variables (quan- 
tities, prices, etc.) are designated as unknowns, in whose determina- 
tion we are interested. Their values emerge as a solution of a 
specified set of relationships imposed upon the unknowns by as- 
sumption or hypothesis. These functional relationships hold as of 
a given environment and milieu. Of course, to designate this 
environment completely would require specification of the whole 
universe; therefore, we assume implicitly a matrix of conditions 
within which our analysis is to take place. 

It is hardly enough, however, to show that under certain condi- 
tions we can name enough relations (equations) to determine the 
values of our unknowns. It is important that our analysis be 
developed in such terms that we are aided in determining how our 
variables change qualitatively or quantitatively with changes in 
explicit data. Thus, we introduce explicitly into our system cer- 
tain data in the form of parameters, which in changing cause shifts 
in our functional relations. The usefulness of our theory emerges 
from the fact that by our analysis we are often able to determine 
the nature of the changes in our unknown variables resulting from 
a designated change in one or more parameters. In fact, our 
theory is meaningless in the operational sense unless it does imply 
some restrictions upon empirically observable quantities, by which 
it could conceivably be refuted. 
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This in brief is the method of comparative statics meaning by 
this the investigation of changes m a system from one position of 
equilibrium to another without regard to the transitional process 
involved in the adjustment By equilibrium is meant here only 
the values of variables determined by a set of conditions and no 
normative connotation attaches to the term As will be shown 
later it is always possible to set up completely trivial equilibrium 
systems 

This method of comparative statics is but one special apphea 
tion of the more general practice of scientific deduction in which 
the behavior of a system (possibly through time) is defined in 
terms of a given set of functional equations and initial conditions 
Thus, a good deal of theoretical physics consists of the assumption 
of second order differential equations sufficient in number to de 
termine the evolution through time of all variables subject to given 
initial conditions of position and velocity Similarly in the field 
of economics dynamic systems involving the relationship between 
variables at different points of time (e g time derivatives weighted 
integrals lag variables functionals etc ) have been suggested for 
the purpose of determining the evolution of a set of economic van 
ables through time 1 At a later stage 1 deal with these dynamic 
problems 

The concept of an equilibrium system outlined above is ap 
phcable as well to the case of a single variable as to so called 
general equilibrium involving thousands of variables Logically 
the determination of output of a given firm under pure competition 
is precisely the same as the simultaneous determination of thous 
ands of prices and quantities In every case ceteris panbus as 
sumptions must be made The only difference lies in the fact that 
in the general equilibrium analysis of, let us say, Walras the con 
tent of the historical discipline of theoretical economics is prac 
♦analbj ‘trhaiiated. Th/t thiiwgt which, ace. taiejj. as. data- Ore that 
system happen to be matters which economists have traditionally 
chosen not to consider as within their province Among these 
data may be mentioned tastes technology the governmental and 
institutional framework and many others 

1 R Frisch On the Notion of Equil br um and Disequ librium Review o f Economic 
111(1936) 100-105 J Tinbergen Annual Survey Suggestions on Quantitative 
ness Cycle Theory Econometnca III (1935) 241 308 
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It is clear, however, that logically there is nothing fundamental 
about the traditional boundaries of economic science. In fact, a 
system may be as broad or as narrow as we please depending upon 
the purpose at hand ; and the data of one system may be the vari- 
ables of a wider system depending upon expediency. The fruit- 
fulness of any theory will hinge upon the degree to which factors 
relevant to the particular investigation at hand are brought into 
sharp focus. And if, for the understanding of the business cycle 
a theory of governmental policy is demanded, the economist can 
ill afford to neglect this need on the ground that such matters lie 
outside his province. As for those who argue that special degrees 
of certainty and empirical validity attach to the relations encom- 
passed within the traditional limits of economic theory, we may 
leave to them the task of proving their case. 

It is not to be thought that the content of systems as described 
above must be restricted to the variables usually considered in 
price and value theory. On the contrary, one employs such con- 
structions throughout the whole field of theoretical economics in- 
cluding monetary and business cycle theory, international trade, 
etc. It goes without saying that the existence of such systems in 
no way hinges upon the employment of symbolic or mathematical 
methods. In fact, any sector of economic theory which cannot be 
cast into the mold of such a system must be regarded with suspicion 
as suffering from haziness. 

Within the framework of any system the relationships between 
our variables are strictly those of mutual interdependence. It is 
sterile and misleading to speak of one variable as causing or de- 
termining another. Once the conditions of equilibrium are im- 
posed, all variables are simultaneously determined. Indeed, from 
the standpoint of comparative statics equilibrium is not something 
which is attained; it is something which, if attained, displays 
certain properties. 

The only sense in which the use of the term causation is ad- 
missible is in respect to changes in external data or parameters. 
As a figure of speech, it may be said that changes in these cause 
changes in the variables of our system. An increase in demand, 
i.e., a shift in the demand function due to a change in the data, 
tastes, may be said to cause an increased output to be sold. Even 
here, when several parameters change simultaneously, it is im- 



10 FOUND AT loss Ob ECONOMIC ANALYSIS 

possible to speak of c msation attributable to each except m respect 
to limiting rates of change (partial derivatives). 

Symbolic Formulation 

All of the above may be stated compactly m mathematical 
form Given it \ ambles or unknow ns (xi, , *„) and m, greater 
or less than n, parameters (at , a„), vve assume n independent 
and consistent functional relationships involving our variables and 
parameters These may be written most generally in implicit 
form, each equation involving all variables and parameters 


/■<*.. 

, X n ttj, 


« 0, 

/»(*• 

. T- ai, 

,a») 

- 0, 


,Xn «l, 

. <**) 

- o, 


or more compactl> , 

/*(ri» , x n ezi a m ) = 0 (t = 1, , n) 

Our equations must not be greater than n in number, for if thc> 
were, they could not be both consistent and independent, if less, 
our system w ill, in general, be under-determined If our equations 
possess certain characteristics, to be discussed later, they may be 
considered to determine a unique set of values of our unknowns 
(*i°, , *»°) corresponding to any preassigned set of parameters 

, a* 0 ) 

This functional relationship ma> be expressed mathematically 
as follow s 

X{ = £*(«!, ,a„) (* = 1 , ,»*) ( 2 ) 

It should be understood that this does not imply that we can 
express our unknown variables as any elementary functions of the 
parameters (such as polynomial, trigonometric, or logarithmic 
functions) On the contrary, our conditions of equilibrium in set 
(1) will not m general be expressible in terms of a finite number of 
elementary functions, and even if they were, we should not be sure 
that they could be solved explicitly in any simple terms How- 
ever, this is of no particular importance for the elementary func- 
are only special forms which have been of historical interest 
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in the development of mathematical thought and its physical ap- 
plications. If one were to draw free hand either at random or as a 
result of a complete set of observations a demand curve relating 
price and quantity, this could not in general be more than approxi- 
mately represented by finite combinations of elementary functions. 
And yet it is a perfectly good functional relationship indicating a 
certain correspondence between these variables. Furthermore, if 
there were any practical need for so doing, it could be tabulated 
once and for all, be named, and ever afterwards be accepted as a 
member of the family of respectable functions. 

It is precisely because theoretical economics does not confine 
itself to specific narrow types of functions that it is able to achieve 
wide generality in its initial formulation. Still it is not to be for- 
gotten that the aim of fruitful inference is the explanation of a wide 
range of phenomena in terms of simple, restrictive hypotheses. 
However, this must be the final result of our research, and there is 
no point in crippling oneself in setting out upon the journey. 

If one were omniscient, i.e., if all the implications of any as- 
sumptions were intuitively obvious, the equations (2) would be 
instantly known as soon as the set (1) were given. Short of such 
powers, they could only be solved for to any required degree of 
approximation with much effort; but, of course, it could be done, 
given the requisite time and patience. 

Considering the amount of labor involved, one is tempted to 
question the advantage of starting from our equilibrium equations 
in (1). Why not begin directly with the equations in (2)? Indeed, 
it may be pointed out that these resulting functions between un- 
knowns and parameters could have arisen from an infinity of 
possible alternative sets of original equations. In particular, con- 
sider the set of implicit equations 

g l (ai, • • •, <Xm) — x, = 0. {i = 1 , • • *, n) (3) 

This can be solved by inspection to give the equations of (2), but 
of course the solution is trivial. It is trivial in the sense that the 
result is intuitively obvious from the beginning, and not for the 
reason that (2) says the same thing as (3). For, after all, this 
equivalence is also true for (1) and (2), but their identity is not 
trivial in this psychological sense. 

It is important not to be confused in these matters, for they lie 
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at the foundations of scientific deduction, and they have been mis 
understood particularly often by economists By deductive rea 
somng we are enabled only to reveal to ourselves implications 
already included in our assumptions We may bnng to explicit at 
tention certain formulations of our original assumptions which ad- 
mit of possible refutation (confirmation) by empirical observation 
This process may best be regarded as a translation of our ongi 
nal hypothesis into a different language, but in so translating — 
provided, of course, that no error in logic has crept in — we do not 
change the nature of our original hypothesis, neither adding nor 
subtracting from its validity and precision 

The usefulness of the formulation of equilibrium conditions 
from which emerges our solution lies in the fact that by so doing 
we often gain knowledge concerning possible and necessary re- 
sponses of our variables to changes in data Without such restric- 
tions our theories would be meaningless Merely to state, as was 
suggested earlier, that there exists a final functional relationship 
between all variables and parameters (as of an infinity of accom- 
panying circumstances) is bare and formal, containing no hy- 
pothesis upon the empirical data 

It is because in a wide variety of cases ive can more or less 
plausibly assume or hypothesize certain properties of our equi- 
librium equations that we are able to deduce with the same degree 
of plausibility certain properties of the resulting functions between 
our unknowns and parameters For the properties of the functions 
in (2) are necessarily related to the structural characteristics of the 
equilibrium set (1) The properties ordinarily discussed in this 
connection are not specific quantitative restrictions upon the func 
tions (as being polynomials, etc ), but rather consist of assumptions 
with respect to slope, curvature, monotomcity, etc , they are the 
kind implied by the law oj diminishing returns 


Displacement of Equilidrium 
It is easy to show mathematically how the rates of change of 
our unknown 0 with respect to any parameter, say <*i, may be com 
puted from our equilibrium equations As a matter of notation let 


dx? __ dg'(ofi°, , g-°) 
don ~ dai 


,««•) 
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stand for the rate of change of the ith variable with respect to the 
parameter a u all other parameters being held constant. It is 
necessary because of the ambiguity of the conventional notation 
of partial differentiation to make sure which variables are being 
held constant. 5 

Such partial derivatives must be evaluated as of a given value 
of the set of parameters, and hence as of a corresponding set of 
values of our dependent variables. Consider the initial position 

(«i°, 

and the corresponding set of unknowns 

, . (*i°> 

where of course 

■■■, x n °, ftl °, • • • , a m °) = 0, (* = 1, • • - , ») (4) 

and 

Xi° = £''(<, • • • , a m °), (i = 1, • • • , n ) (5) 

since our unknowns must satisfy the equilibrium conditions. 
Differentiating each equation of (1) with respect to ai, remember- 
ing that all other parameters are to be held constant, but that all 
our unknowns are variable, we get 2 



with all other variables and parameters held constant. Similarly, 


r { 3f(x 1°, • • ' , X n °, «i°, • • * , «m°) 
Jai “ daj 

’■ In matrix terms this is = [— fall- 
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r 4 


Be it noted tint the numerical values of these parti d derivatives 
are fully determined at the equilibrium point in question Thus 
vve have n linear equations with constant coefficients in n unknowns 
[(dxt/dai) 0 dx»/daj) 0 !] The solution values of these will de 
pend upon the values of the coefficients thus the partial deriva 
tives relating our dependent variables and parameters are deter 
mined by the structural properties of our equilibrium system 
Since (6) represents linear equations their solution for non 
singular cases may be represented in the familiar detcrminantal 
form 


/Ar \0 £/• A * 

(srO " - <* - 1 "> 


A = I/-. I = 




I/." /„■ 




m 


and A * indicates the cofnctor of the clement of the *th row and the 
£th column Or in matrix terms 


[ a ^J = -c/.w] w 


Illustrative Tax Problem 

For concreteness let us apply our anal} sis to two simple cases 
Consider a firm with a gw cn demand curve relating price and out 
put and a given production cost schedule relating total cost and 
output Suppose in addition that the output of the firm is subject 
to a tax of t dollars per unit The profit of tl e firm may be written 

ir = tota’i revenue — Tt/ru prufrcittaun cuA — \xtaA 
— xp(x) — C(x) — l\ 

where 

xp(x) — total revenue in function of output 
C(x) = lowest total production cost at which each output can 
be produced 
tx = total tax payment 
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It is clear that for any given tax rate, say t°, the firm will decide 
upon some given output to be produced and sold; i.e., 

= g0°). (9) 

where the functional relationship g corresponds to the functions 
in (2). However, we cannot leave the matter in so indefinite a 
state. We wish to know more than that there exists an equilibrium 
output for each tax rate. What is the nature of the dependence 
of our variable upon the tax rate regarded as a parameter? Will 
an increased unit tax result in a larger or smaller output? It is a 
poor theory indeed which will not answer so simple a question. 
Let us see if we can arrive at an answer to this through the formula- 
tion of the conditions of equilibrium. 

It is in general assumed that a firm will select that output which 
will maximize its net revenue. That is to say, our equilibrium 
value for output will emerge as a solution of a simple maximum 
problem. Specifically, it is necessary 3 for a regular maximum of 
profit with respect to x, as of a given tax rate, that 


drrjx, t) _ 
dx 

d 2 Tr(x, t) n 
dx 2 ^ 


( 10 ) 


The first condition merely states that at a maximum the tangential 
slope of the profit function plotted against output must be hori- 
zontal or equal algebraically to zero. The second condition in- 
sures that we do not have a minimum. 

For the problem at hand our condition of equilibrium may be 
derived by simple differentiation to become 


[ xp{x) — C(x)~\ — t — 0, (11) 

where it is assumed for simplicity that the inequality in (10) is 
realized for all values of the variable concerned. Now equation 
(11) corresponds to our equilibrium set (1), which in this case con- 
tains but one equation due to the fact that we have only the value 
of one unknown to determine. 


3 See Mathematical Appendix A, sec. I. 
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For each value of t, there will correspond a root of the resulting 
equation in x and this will be our equilibrium value Thus, the 
function (9) may be regarded as the explicit solution of this implicit 
equation 

Now what have we gained by introducing the maximum prob 
lem for the firm? Does this enable us to answer our original ques 
tion as to the nature of the dependence of the output upon the tax 
rate? Let us apply the general method outlined above to calculate 
the rate of change of equilibrium output with respect to the 
parameter t Differentiating (11) with respect to t we get 


(I) -**« 


In this simple case no resort to determinants is necessary in order 
to achieve a solution for 


fdx\° 1 

- CM] 


But this provides us with the answer for which we have been seek 
ing For as a sufficient condition for a relative maximum we 
know that 


| ? [*W)-CW]<0 


(ty<° 


or g'(*°) < 0 


ivhich is what intuition tells us would happen as a result of such a 
tax Be it noted in passing that it is assumed that the firm is 
always in equilibrium before and after the tax is imposed and that 
the tax affects the equilibrium only as indicated m equation (11) 
In any actual case considerable attention must be devoted to the 
problem of verifying the correctness of these assumptions before 
any practical application is made of the conclusions reached 
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Illustrative Market Case 4 

Let us consider as another example a market for a good or 
service where the price and quantity are determined by the inter- 
section of hypothetical supply and demand curves. Furthermore, 
let us introduce a parameter of shift, a, into our demand curve 
(e.g., tastes, tax, rival price, etc.). Here we have two variables, 
one parameter, and two equations to define equilibrium values of 
our variables in terms of the parameter. Mathematically, 

D(x, a) — p = 0, 

S(x) — p = 0. 

As a solution we have 

x° = g^a 0 ), 

po = g 2( a °). 

How then will our variables change with changes in a assuming 
that an increase in a shifts the demand curve upwards and to the 
right? As before, we differentiate our equilibrium relations with 
respect to the parameter to get two linear equations, 


(16) 

(17) 


C W 
dx 

S’ 


(dx\° _ (dp\ u 
\ da ) \ da ) 

(dx\° _ fdp\° 
\ da) \da) 


dD 

da 


= 0 . 


By simple substitution we get 



(18) 


(19) 


Now we know that dD/da > 0 by definition of our parameter of 
shift. Thus, (dx/da)° ^0 depending upon whether S' 5 dD/ dx. 
At first sight this seems merely to put off the dilemma, to replace 
one equation by another. But if we examine the kind of market 


* This is treated in greater detail in chap, lx below. 



1 8 FOUNDATIONS 01 ECONOMIC ANAI \ SIS 


under consideration, it will appear that the mere fact that the 
market is in stable equilibrium in the initial situation will remove 
all ambiguity For if the market is the familiar Marshallian con 
sumer s good market stability of equilibrium by definition requires 
that the supply curve cut the demand curve from below (even in 
the case of decreasing cost due to external economies) * 

Thus, 


Therefore, 



( 20 ) 

(fi)’>° 

( 21 ) 


However, the algebraic sign of the price change will hinge upon 
whether there is a positively or negatively inclined supply curve 
For 

Thus, since ( dx/da)° is positive from (21) (dp /da) 0 and S' must be 
of the same sign, or 

(g)V>° ( 23 ) 

It is impossible in the price case to get rid of the final ambiguity 
Suppose, however, that this were a market for a factor of pro 
duction Here the conditions for stable equilibrium are familiarly 
defined to be either a positively sloping supply curve with a nega- 
tively sloping demand curve, or if the supply curve is negatively 
inclined, it must be backward rising and steeper than the demand 
curve 6 Mathematically, our stability conditions may be expressed 


dx * 


<t> 




hor this case, therefore, the sign of the price change is known, while 
the quantity change is ambiguous depending upon the algebraic 

6 A Marshall Pnttctplei of Economics (8th edition) p 346 n 1 p 806 n I 
•J R Hicks Value and Capital (Oxford 1939) chap v 
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sign of the slope of the supply curve. In brief, 



Innumerable other examples could be cited. In general, we 
should not expect to be able to determine the signs of the rates of 
change of our variables upon the basis of simple a priori qualitative 
restrictions on our equilibrium equations. This is not because of 
the difficulty and complexity of solving a large number of equa- 
tions; for these could be solved if sufficient were known about the 
particular empirical values of our conditions of equilibrium. It is 
rather that the restrictions imposed by our hypotheses on our 
equilibrium equations (stability and maximum conditions, etc.) 
are not always sufficient to indicate definite restrictions as to alge- 
braic sign of the rates of change of our variables with respect to 
any parameter. 

Only imagine a change in a parameter which enters into all of 
a large number of equilibrium equations causing them simultane- 
ously to shift. The resulting net effect upon our variables could 
only be calculated as a result of balancing the separate effects 
(regarded as limiting rates of change), and for this purpose detailed 
quantitative values for all the coefficients involved would have 
to be known. 

Summary 

Before going on in the next section of our work to indicate how 
the economist is enabled to deduce significant results in a wide 
category of cases, it will be well to summarize the thread of the 
argument thus far. 

a. For theoretical purposes an economic system consists of a 
designated set of unknowns which are constrained as a condition 
of equilibrium to satisfy an equal number of consistent and inde- 
pendent equations (see equation 1). These are implicitly assumed 
to hold within a certain environment and as of certain data. Some 
parts of these data are introduced as explicit parameters; and, as 
a result of our equilibrium conditions, our unknown variables ■ 
be expressed in function of these parameters (see equation 2^ 
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b The method of comparative statics consists of the study of 
the responses of our equilibrium unknowns to designated changes 
in parameters 1 e we wish to know the properties of the functions 
in (2) In the absence of complete quantitative information con 
cermng our equilibrium equations it is hoped to be able to forma 
late qualitative restnctions on slopes curvatures etc of our equi 
librium equations so as to be able to derive definite qualitative 
restrictions upon the responses of our system to changes in certain 
parameters It is the primary purpose of this work to indicate 
how this is possible in a wide range of economic problems 
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THE THEORY OF MAXIMIZING BEHAVIOR 

Three Sources of Meaningful Theorems 

It will be remembered that in the case of a unit tax on a firm’s 
output it was possible to state unambiguously the direction of 
change of output with respect to a change in the tax rate. This 
was so because of the fact that the equilibrium output for each tax 
rate was derived from the condition that profit must be at a 
maximum. 

As will become evident in the course of our exposition, this is 
by no means an accidental, isolated case; it is merely an application 
of a very general principle of economic method, which lies at the 
bottom of much of economic theory. In fact, aside from those 
parts of economic doctrine whose results are inconclusive — and the 
most ardent advocates of economic theory will admit that this 
includes a large part of accepted analysis — there is not much which 
cannot be brought under this heading . 1 

The general method involved may be very simply stated. In 
cases where the equilibrium values of our variables can be regarded as 
the solutions of an extremum ( maximum or minimum ) problem, it is 
often possible regardless of the number of variables involved to de- 
termine unambiguously the qualitative behavior of our solution values 
in respect to changes of parameters . 2 

It so happens that in a wide number of economic problems it 
is admissible and even mandatory to regard our equilibrium equa- 
tions as maximizing (minimizing) conditions. A large part of en- 
trepreneurial behavior is directed towards maximization of profits 
with certain implications for minimization of expenditure, etc. 

1 The reader may verify this result by paging through any good economics textbook, 
such as Marshall’s Principles, and analyzing the derivation of various theorems stated. 

2 It may be pointed out that this is essentially the method of thermodynamics, 
which can be regarded as a purely deductive science based upon certain postulates 
(notably the First and Second Laws of Thermodynamics). That such abstract reasoning 
should in the hands of Gibbs and others lead to fruitful theorems testifies to the validity 
of the original hypotheses. 
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Moreover, it is possible to derive operationally meaningful restnc 
tive hypotheses on consumers’ demand functions from the assump 
tion that consumers behave so as to maximize an ordinal preference 
scale of quantities of consumption goods and services (Of course 
this does not imply that they behave rationally m any normative 
sense ) 

It is not to be thought that in principle all economic results 
emerge from these maximizing assumptions 3 For, as we have 
seen, it was also possible to deduce conclusive qualitative results 
from certain stability assumptions Nevertheless, many of these 
stability conditions rest implicitly upon maximizing behavior 

Moreover, certain difficulties arise here While of course, it is 
always possible to lay down arbitrary definitions of stability, it 
is impossible to deduce them without the implicit introduction of 
dynamical considerations concerning the behavior of a system out 
of stationary equilibrium Depending upon the dynamical set up 
envisaged, different stability conditions are implied Thus given 
supply adjustments of the type assumed in the cobweb cycle phe 
nomenon, it is well known that the ordinary Marshallian condition 
of a positively rising supply curve may not result in "stable’ 
equilibrium * 

It is true that the identification of equilibrium with a stable 
maximum position m a sense begs the question of stability How 
ever, where these extremum conditions are realized it can be shown 
that many dynamic set ups will give rise to dampened oscillations 
as a result of small displacements The relations between dynamic 
theory and the stability of equilibrium are discussed in the last 
chapters 

There remains still another possibility by which conclusive 
theorems may be deduced We may know in advance certain 
qualitative properties of our equilibrium equations Thus, refer 
ence may be made to alleged technological and psychological laws, 

* Thus the definition of economic theory as the study of scarce means with altema 
Uve uses I would regard as too broad from one po nt of view, and much too narrow 
from another 

* A stationary equilibrium is stable providing the specification of initial conditions 
differing only sightly from the stationary equilibrium values results in an evolution 
which tends (at least in the limit) to approach the eqml bnum values See Part II 
below 
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held to be plausible upon a priori grounds . 5 Even here, as many 
economists have pointed out, the reasoning may rest in the 
last analysis upon certain maximum considerations. Thus, in 
demonstrating the validity of the law of diminishing marginal phys- 
ical productivity we ordinarily point to the fact that firms in pure 
competition are in equilibrium under a given set of factor prices. 
This would not be possible if the law of diminishing marginal pro- 
ductivity did not hold. Again we point to the fact that farmers 
do not devote all their attention to raising grain on one square inch 
of land; they use much land of the same kind, and even inferior 
grades of land. Thus, we really argue backwards from maximizing 
economic behavior to the underlying physical data consistent 
with it. 

There is still another type of problem for which the study of 
maximizing behavior is illuminating. In some cases, as we shall 
see later, it is possible to formulate our conditions of equilibrium 
as those of an extremum problem, even though it is admittedly not 
a case of any individual’s behaving in a maximizing manner, just 
as it is often possible in classical dynamics to express the path of 
a particle as one which maximizes (minimizes) some quantity 
despite the fact that the particle is obviously not acting consciously 
or purposively. 

For all of these reasons the study of maximizing behavior affords 
a unified approach to wide areas of current and historical economic 
thought. There is, moreover, considerable advantage in discussing 
the problem at first in its full generality. The high degree of 
abstractness will be more than compensated for in the ease with 
which numerous applications can be deduced as special cases. 

A Calculus of Qualitative Relations 

Before I enter upon the theory of maximizing behavior, it may 
be illuminating to show why intuition and a general feeling for the 
direction of things does not carry one far in the analysis of a com- 
plex many- variable system, such as is characterized by equation 

6 In a problem of any complexity involving a number of variables, intuition is a 
poor guide for the reasons discussed in the next section. All presumptions become 
dubious. In such cases the economist often falls prey to the perils of assuming the 
equiprobability of the unknown. As a result, every reformulation of the problem results 
in changed presumptions. This is no doubt one reason why every terminological 
revolution in economic thought brings with it a recasting of belief. 
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(1) of the previous chapter To make matters even simpler than 
they usually are m reality let us suppose that we know the quah 
tative direction of movement of each of our equilibrium equations 
with respect to a change in all of the variables and parameters 
Thus we know at least the algebraic sign of each of the first partial 
derivatives of the form /* ’ or/. It does not matter how w e came 
about such knowledge for example we might confidently believe 
that individuals will divide an additional dollar of income frac 
tionally between consumption and saving not in consequence ol 
some economic theory of maximization but simply as a result of 
everyday observation 

What then can we sa> concerning the direction of change of anv 
particular variable in response to a change m some parameter 1 
According to equation (7) of chapter u this is given by a compli 
catcd expression which can be written as the quotient of two n by 
n determinants whose elements are made up of these first partial 
derivatives Now in order for our question to have an immediate 
definite answer we must be able to determine unambiguously the 
signs of each of these determinants From the original funda 
mental definition of a determinant each consists of n 1 terms each 
term involving the product of n elements From our hypothesized 
knowledge we can be sure of the sign but not the magnitude of each 
term Only if all of the « 1 terms happen to be of the same sign 
will the sign of the determinant be unambiguous If our system 
involves ten variables the determinants will have more than three 
million terms Regarded simply as a problem in probability the 
chance that a run of this length should always have one sign is 
about one out of one with a million zeros after it Therefore unless 
some special feature is present we are almost sure to find it neces 
sary to weigh the magnitude of some terms against others which 
is to enter upon quantitative problems admitting no answer b> 
qualitative methods 

Nevertheless let us see how far qualitative methods will take 
us in the slightly simpler case where only one parameter is changed 
m such a manner as to alter only one of the equilibrium relations 
As before the signs of all first partial denvatives are known Con 
sideration of all possible cases that may anse will show that there 
is no loss in generality m assuming that it is the first equation which 
shifted that the partial derivative of each implicit equation with 
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respect to its own variable /,/ is always positive, and that the 
change in the first equation with respect to the given parameter is 
positive. If these conditions are not realized, they can be brought 
about by changes in sign of one or more of the equations or variables 
and the parameter. 

By our qualitative knowledge we are able to specify a matrix 
of the form 


sign// = 


r + 


± • • • 

± 

± 

+ 

± • • • 

± 


± 

+ ••• 

=t 



± • • • 

+ 


( 1 ) 


where the n columns represent (xu • • •, x n ), and the n rows repre- 
sent (f\ • • •, f n ). The sign in each element will represent the as- 
sumed known sign of the partial derivative of the variable corre- 
sponding to that row taken with respect to changes in the variable 
corresponding to that column. By our previous convention, we 
have made all diagonal elements plus. All the remaining elements 
can be of either sign, but in any one case they are assumed to be of 
definite known signs. All in all, there are a vast number of possible 
matrices of this kind which could arise in any problem, in fact all 
together 2 raised to the n(n — 1) power. 

We are interested in the direction of change of our unknows, 
or in the signs of ( dxi/da , • • • , dx n /da). If nothing is known a 
priori, these may take on any one of all the possible 2 n arrange- 
ments of signs. 
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+ ••• 
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+ 
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+ ••• 
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can 

now 

j 

be eliminated 


2 " 


( 2 ) 


basis of our qualitative knowledge as embodied in the specification 
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of the matnx (1)? Ideally, we should hope to be able to rule out 
all but one of the possible combinations so as to get a unique answer 
However, it would be at least desirable to be able to determine the 
sign of at least one of our unknowns, or, what comes to the same 
thing to be able to rule out a particular half of all the possible 
combinations x ^ 

So much for our aspirations, turning to the exact procedure by 
which we rule out a combination, we find ourselves m for dis 
appointment If we substitute into equation (6) of chapter n the 
signs of the indicated elements, we shall be able to rule out a 
combination if and only if it leads to a contradiction i e if it does 
not add up to zero, or to a negative number as it should 

Concretely, this can be seen to mean that we can rule out any 
one of the above combinations of sign in (2) which exactly duph 
cates any of the last ( n — 1) rows of (1), or which is the exact 
antithesis of any of these same rows Otherwise, one of the last 
(« — 1) equations of (6) chapter n could not add up to zero as 
required by the present hypothesis In addition we may rule out 
any combination of (2) which exactly duplicates the first row but 
we are not able to rule out its antithesis {o 64s 2_ 

All told then we are at most able to rule out by qualitative 
considerations only (2 n — 1) combinations out of a grand total of 
2" possible combinations Even for moderate sizes of n this leaves 
us with all but a minute fraction of possibilities And this is the 
largest number that we can rule out on such a hypothesis In 
many cases we are not able to rule out even this many Thus, if 
any two rows in our original matrix have exactly the same signs, 
they will both rule out the same combinations which cannot there 
fore be added together for fear of double counting 

It can be seen then that purely qualitative considerations can 
not take us very far as soon as the simple cases are left behind 
Of course if we are willing to make more rigid assumptions either 
of a qualitative or quantitative kind we may be able to improve 
matters somewhat Ordinarily, the economist is not in possession 
of exact quantitative knowledge of the partial derivatives of his 
equilibrium conditions None the less if he is a good applied 
economist, he may have definite notions concerning the relative 
importance of different effects, the better his judgment in these 
matters the better an economist he will be These notions which 
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are anything but a priori in their original derivation, may suggest 
to him the advisability of neglecting completely certain effects as 
being of a second order of magnitude. In other words, zeros are 
inserted into the matrix of (1). In fact, the so-called method of 
partial equilibrium consists of nothing more than a liberal sprinkling 
of zeros into the equations of general equilibrium. In the hands 
of a master practitioner, the method will yield useful results; if not 
handled with caution and delicacy, it can easily yield nonsensical 
conclusions. 

By simple extensions of the above argument we can show how 
the presence of a zero in any row permits that row to rule out four 
instead of only two combinations. Similarly, r zeros in a row per- 
mits that row to eliminate 2 r+1 combinations. As before, there 
may be duplications in the eliminating influence of different rows. 
It is also apparent that definite knowledge concerning the sign of 
any one of the variables will enable us to rule out one-half of the 
original number of combinations, or 2 71-1 combinations. Because 
of duplicating effects, definite knowledge of the signs of the changes 
in two of the unknowns will enable us to eliminate less than twice 
as many as one unknown; actually, by two known signs we can 
eliminate 3(2 n_2 ) combinations in all. Definite knowledge of the 
signs of k unknowns will permit the elimination of all but 2 n ~ k 
combinations. 

It would be possible to illustrate the above calculus of quali- 
tative relations by reference to a number of well-known economic 
problems. Space will permit brief mention of a few. In the 
market illustration of the previous chapter 6 there was considered 
the case of a simple Marshallian partial equilibrium market in- 
volving two unknowns, price and quantity, whose equilibrium 
values are determined by the intersection of supply and demand 
schedules. We may apply our analysis to the determination of the 
changes in our variables resulting in an assumed shift of a nega- 
tively inclined demand curve. If the supply curve is known to be 
positively inclined, the sign matrix can be written in the form 
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In this case, we are able to rule out the maximum number of com 
binations, (2 n — 1), or three in all Since there are only four 
combinations we are left with a unique answer, as is apparent from 
the algebraic analysis of the previous chapter If the supply curve 
is assumed to be negatively inclined the signs m the second row 
both become plus, and we are able to rule out only the minimum 
number possible, or two We are left then with a final ambiguity 
which cannot be settled except by quantitative knowledge or by 
means of various stability hypotheses 

A more illuminating and more difficult example is that of the 
simplified Keynesian system, described in greater detail in chapter 
ix Without going into details here, it can be stated that this gives 
rise to a system of three variables, interest rate xj, income x 2 and 
investment, x 3 whose equilibrium values are determined by three 
equations, liquidity preference, f 1 , marginal efficiency schedule, f 2 , 
and propensity to consume P Making the usual assumptions 
concerning the signs of the various first order effects, e g that 
investment vanes inversely with the rate of interest and that the 
effect of interest on consumption is of an order of magnitude which 
can be neglected, we end up with a sign matnx of the form 

'+ ~ O' 

(sign//) = - + - (4) 

. 0 -+. 

We may now consider a shift in any one Of the schedules e g 
a change in the liquidity preference schedule brought about by 
some policy measure If we apply the above calculus we are still 
left with three possible combinations for (sign change in *.) 

namely, (4- 4- 4-), ( h 4-),or( ) In the case of a shift 

of the marginal efficiency schedule, we narrow the choice down to 

two, (4-4-4-) or ( ) For a shift in the propensity to 

consume schedule we end up with three choices, (4-++)» 
(+ H ), or ( ) 

It will be noted that in none of the cases arc we able to make a 
definite statement about even one of the variables In the second 
case, which is the most favorable, we can only- say that an increase 
in the marginal efficiency schedule will either raise interest, income, 
and investment, or else lower all three of them This is a highly 
unsatisfactory state of affairs particularly since it seems on the 
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face of it doubtful that an increase in the marginal efficiency of 
capital should lower interest, income, and investment. But it is 
only by bringing in the stability considerations of later chapters 
that a more definite, and more reasonable deduction can be made. 
In the discussion in chapter ix it is shown that stability hypotheses 

will leave us with only one combination for the first case, ( f -f) ; 

for the second case, ( + H — b) ; and for the third case the two possi- 
bilities, (-1 — 1 — b) or (-1 — b — ). This last ambiguity is irremov- 
able unless still stronger quantitative assumptions are made. 

Numerous other economic examples could be mentioned, but 
these may be left to the reader. 7 Before returning to the problem 
of maximizing behavior, I should like to call attention to the fact 
that the qualitative calculus is not invariant under transformation 
of the variables. 

Maximum Conditions of Equilibrium 

Let us consider a new variable, 0 , defined by a single-valued 
function of our previous variables, 

0 — /(x 1 , ' , Xnj 0!i, * 1 tXm) t (o) 

where/ and its partial derivatives of at least second-order exist and 
are continuous in a wide region. If for any preassigned values of 
the a’s, there exists a set of values for the .t’s, (xj°, • • ■ , x n °), corre- 
sponding to which 0 is at a maximum, then we must have 

J(xi, - - • , x n , a®, • • - , a m °) g f(x 1 °, • - • , x n °, ai° , • • • , a m °). (6) 

As a matter of notational convenience this may be written 

f(X, a°) ffj(X\ «°), (7) 

7 One last illustration taken from the field of international trade may be dealt with 
briefly. Professor Leontief has produced a numerical example illustrating the possibility 
that a unilateral payment from one country to another may so shift the terms of trade 
in favor of the paying country as to cause it to be better rather than worse off as a result 
of the transfer. “Note on the Pure Theory of Transfer,” in Explorations in Economics 
(New York, 1936), pp. 84-92. The example is carefully framed so as to guarantee 
indifference curves of the proper curvature for both countries. However, if one sets up 
an analytical system, along the lines of the numerical example, one will find that the 
Leontief Effect can only happen for a system in which an increase in demand for a com- 
modity lowers rather than raises its price. If the latter phenomenon is ruled outas being 
anomalous, or incompatible with stability (defined arbitrarily or in terms of a dynamic 
set up), then we can by the same action rule out the possibility of the Leontief Effect. 
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where X stands for the arguments (*,, , *„), and a stands for 

the arguments (a t , , a m ) 

If z° represents an absolute maximum with respect to all ad 
missible values of our independent variables, then 

/(*,««) </(*°,«°) (8) 

On the other hand, z° may merely represent a maximum relative to 
all x's lying in some restricted neighborhood of the point (X°, a 0 ) 
We know from Mathematical Appendix A, Section II that in 
order for 2 ° to enjoy a relative maximum it is necessary that 

!^“0=/*(*i 0 . . *»°. «i°. . «m°), (**=1, ,n) (9) 

and 

EZ/.^/SO, (10) 

i i 

where the h’s are arbitrary numbers For a regular relative maxi 
mum the last condition can be written 

£ S/WM, < o, (11) 

1 1 

not all h's equal to zero In other words, this symmetrical homo 
geneous quadratic form must be negative definite (See Mathe 
matical Appendix A, Section II ) 

Displacement of Equilibrium 
The set of equations in (9) can be regarded as our conditions of 
equilibrium corresponding to the set (1) of the second chapter, and 
may be assumed to yield an explicit solution for our unknown equt 
librium values m terms of the preassigned parameters 8 

x % ° = g‘(ai°, ,a«°) (* = 1, , «) (12) 

Using the method of the previous chapter we may easily solve 
for the rates of change of our solution values with respect to the 

• From the implicit function theorem it is known that assuming the definiteness of 
the quadratic form throughout the whole region under discussion will insure us of the 
uniqueness of our relative maximum position Also our maximum conditions are essen 
tially invariant under any non singular transformation of variables Furthermore any 
properly monotonic function of w enjoys an extremum position for the same values of the 
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ktb. parameter by means of the following : 


y f o _ 
i ■ hiXi da k . 


f 0 

Jx i a k j 


(i = !,•••, n) 


(13) 


where of course, 


dxj 

da k 


ga k : i.<xi°, , a m °). 


U - 1, ■••,») 

(£ = 1, • • •, m) 


(14) 


As in (8) of chapter ii, our solution may be written in the de- 
terminantal form 


where 


dXj 

doth 


ZMHi, 
Tl ’ 


f 0 - • • f ° 

j x i*i j x i x „ 


H = 



f o ... f 0 

J x n x 1 J x n*n 


(15) 


(16) 


and Hij is the cofactor corresponding to the element of the fth row 
and the jth column of the Hessian H. As will appear later, we 
are able in a large class of cases to evaluate the algebraic sign of 
this expression. In what follows where there is no risk of am- 
biguity, I shall omit the superscript zero. 

First let us derive a relationship of complete generality. Mul- 
tiply the it h equation in (13) by dxi/da k respectively to get 


dXi ” dxj _ _ . dxj 
da k i ^ x<x ’ da k r ‘“* da k 


(i =!,•••,«) (17) 


Summing with respect to i over all equations we have 


But from (11) 


f f , dXi dXj _ " , dXj 

V i JxiX ’ da k da k da k ' 


Z ZU/iihj < 0, 
1 1 


holding for any h’s. In particular, for 

dxi 


= 


da k 


(18) 
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or from (18) 




71 cfoc 

F*'S = >0 


(20) 

( 21 ) 


for not all dx /doa equal to zero In words this compound term 
consisting of the weighted sum of our unknown rates of change 
must definitely be positive in sign However this does not add 
very much to our knowledge since we do not know which terms 
will be positive 

As mentioned earlier however we are not always interested in 
parameters which shift each and every one of our equilibrium con 
ditions For this would necessitate a knowledge of the relative 
quantitative importance of each shift before we could hope to 
evaluate the composite result For this reason we often narrow 
our problem by considering parameters which cause only one of 
our equilibrium equations to shift Let us restrict our attention 
therefore to a set of such parameters (a-i «„) equal in number 
to our unknowns We may number each in the order correspond 
mg to the respective equilibrium equation which it shifts Then 
since a change in the £th parameter must leav e all other equations 
unchanged we have 

/*„ t = 0 for j*k (22) 

Now our inequality of (21) reduces to the more simple and more 
readily applicable condition 

/~sl > 0 (23) 

In words for the class of parameters now under consideration the 
rate of change of the ith variable with respect to its corresponding 
parameter must be of the same sign as/ Xt ,, t which in turn is positive 
if the shift of the equilibrium equation is in the direction of an 
increase in x * 

This can be verified by the computation indicated in (15) 
According to our present hypothesis 

3Xj /jjajffu 

3«i H 


(24) 
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In Mathematical Appendix A, Section III, it is shown that 

< 0 , (k - 1 , ■••,») 
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(25) 


as a condition for a true maximum. Hence, 


or 



(26) 

(27) 


Let us examine more closely the nature of our hypothesis em- 
bodied in (22) that each parameter shifts but one condition of 
equilibrium, leaving all others unchanged. In the first place this 
does not mean that a change in the it h parameter results in a 
change in the ith variable alone. On the contrary, a change in 
any parameter will typically result in a change in all variables. 
Our hypothesis merely says that this must come about through a 
shift in but one schedule with movements along the remaining 
schedules. 

As I shall later argue, the assumption of this hypothesis does 
not involve any serious loss of generality and still includes the vast 
majority (in fact, it is hard to find exceptions) of relationships 
contained in current economic theory. 

The most general function for which the partial differential 
equations of (22) hold may be written 

z — 6 (x x, • • • , x„) + B l (x i, <xi) -j- B 2 (x 2, «2 ) d f- B n (x„, a n ). (28) 

This may be verified by successive differentiation to obtain 


dh _ _ d 2 B> 

dXjdctk x ‘ a> - dXjdak 


for j ^ k. 


(29) 


For the rest of this chapter unless otherwise indicated I shall 
consider functions of this restricted type. The inequality of (21) 
still holds in any case, but is less immediately applicable. After 
all, generality is not an end in itself. A theory may be so general 
as to be useless. It is for simple theories which have wide appli- 
cability that we must look. 
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Displacement of Quantity Maximized 
We have Seen how the equilibrium quantities ( x x *„) 

change when parameters vary Any function of them also vanes 
m a determinable way In particular the quantity to be maxim 
ized 2 will change and its laws of change take on a very simple 
form Let 

2 - /(*i x* cc) (30) 

where a is any parameter Let the re s be functions of a determined 
by a maximum position i e by 


q- =/(x i x„ a) = 0 (* = 1 n) (31) 




That is the first order change m z is exactly equal to the change 
in 2 when the x s are not varying optimally so as to keep z at a 
maximum only to terms of a higher order is there a difference m 
the way z is changing A similar relation can be shown to hold 
m the case of a constrained maximum 

This is the familiar relation of tangency between the envelope 
of a family of curves and the curves which it touches It has many 
economic applications of which only a few need be mentioned In 
the famous dispute between Professor Viner and his draftsman 
Dr Wong the question arose as to the correct relationship between 
long and short run curves 9 The short run curve is drawn up by 
minimizing total (and average) costs for each and every output as 
of gtiert amounts of some fixed factor The long run curve requires 
lowest total (and unit) costs for every output as plant is optimally 
adjusted But according to our theorem a change in the parameter 
output will to the first order result in the same change in total (and 
amt/ costs' wiVerr piVnrf ifxetf as wibnr plstet iw op&wraAy Af/awAaf 
Dr Wong was right then in insisting upon tangency 

Another example is provided by Professor Viner 10 Let labor 
marginal cost be the rate of increase of total costs when only labor 

•J V ner Cost Curves and Supply Curves Ze tschf ft fur Naltonalokonomte III 
(1932) 23-16 

• J V ner Stud « m the Theory of In ernal onal Trade (New York Harper 1937) 

S S16 
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is varying so as to produce the extra output. It is to be dis- 
tinguished from marginal cost of labor which appears in monopsony 
theory, and the marginal labor cost which means the increase in 
one component of costs as all factors are optimally varying. Let 
marginal cost be the rate of increase of total costs with respect to 
output as all factors vary optimally. Then our theorem states 
that labor-marginal cost equals marginal cost equals any other 
factor-marginal cost. As Professor Viner has pointed out with 
great insight, at the margin (i.e., neglecting differential coefficients 
of higher-orders) all factors are perfectly indifferent substitutes. 
The classical economists, lacking the precise notion of an infini- 
tesimal, were forced to employ the concept of a broad extensive 
margin. (Viz., Ricardo’s no-rent land, and J. B. Clark’s famous 
zone of indifference.) 11 

For finite movements, however small, higher-order terms will 
make the change in the quantity maximized (minimized) different 
when the unknowns are optimally adjusted from the change when 
they are held constant. Actually, 


d 2 z _ ” . d?Xi ” dxi d(fj ) ” dxj 

da 2 “ \ Ji da 2 + r da da ^ 2 t Jia da +Jaa 

n fjY . 

= 0+0 +Zf i aj£+faa, 


(33) 


since /i = 0. 

The higher-order change in z when all x’s are held constant is 
given by 



(34) 


The difference between the former change and the latter is as 
follows: 


d 2 z d 2 z _ " , dxi 
da 2 da 2 1 ' a da 


(35) 


because of equation (21) above. 

If the changing parameter affects profits or ordinal utility ad- 
versely, it does so less when output and consumption are optimally 
adjusted to the new circumstances. If the parameter improves 

u For still other examples see Marshall’s Principles, Mathematical Apper 
Note XIV, pp. 846-8S2. 
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profits (etc ) it does so most when the unknowns are optimally 
adjusted 

One could go on to still higher terms These would be seen to 
depend upon the \anous partial derivatives of / and upon dx /da 
d 2 x / doi 8 etc The changes in z are of higher order than the changes 
in the x s In fact the nth derivative of z depends at most upon 
the (» — l)th derivative of the xs This is brought out m the 
above equations 

Auxiliary Constraints and tiic Generalizld 
Le Chatelier Principle 

If the equilibrium of a system is determined by extremum con 
ditions where all unknowns are independently variable the addi 
tion of auxiliary constraints (satisfied by the equilibrium position) 
will leave the equilibrium unchanged If a true relative maximum 
is attained a movement in any direction leads downhill a fortiori 
movements along certain subsets of directions will be downhill 
The usefulness of what may at first seem a strange process lies in 
the fact that it enables us to deduce necessary conditions which the 
equilibrium must satisfy 

Thus with plant fixed a variable factor will be hired until its 
wage (marginal cost) equals its marginal value productivity In 
the long run plant cannot be taken as fixed Nevertheless the 
short run condition holds in the long run as well (but not vice 
versa) since long run total costs cannot be at a minimum unless 
short run total costs are as low as possible Again in discriminat 
mg monopoly the condition that any given output be divided 
optimally between two markets (equal marginal revenues) holds 
even when total output is not given but is determined by cost 
considerations 

How is the equilibrium displaced when there are no auxiliary 
constraints as compared to the case when constraints are imposed ? 
How does the demand for a factor vary with its price when other 
factors cannot be optimally adjusted because of time lags etc 7 
This type of question is important in thermodynamics as well as 
in economic systems It admits of a simple answer 

Let z be of the follow mg special form 

z - 0(*! Xn) — <* 1*1 — a — 


— a„X „ 


(36) 
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If all unknowns are independently variable, a maximum is deter- 
mined by the equilibrium conditions 


~ = 0i(x 1 °, • • • , *„°) - at = 0, (i = 1, • • • , n) (37) 
and 

m = im 

is the matrix of a negative definite form. How does x,- change when 
its “conjugate” parameter a ; varies? 


dXj _ iJ ti 
(/a,- _ ~R 


(38) 


Let r independent additional linear constraints be imposed such 
that 


E gf(xj - x,°) =0, (0 = 1, • • •, r) (39) 

1 

where 

[g/J 

is of rank r. 

The introduction of constraints requires that our equilibrium 
system be modified to assume the following form 


0* + E ^Pgf ~ a i ~ 0) (f — 1 ) ' ‘ ' 1 w) 

1 

E gKxi ~ x*) = 0, 08= 1, 


(40) 


where the X’s are Lagrangean undetermined multipliers. 

Let T Ii represent the determinant formed by bordering H with r 
rows and columns consisting of the coefficients gf; i.e., 


T H = 


| M g±L 
gf 0 


(P,7 = 1 , ■■■>>') 


(41) 


Then the change in Xi with respect to ai when r auxiliary con- 
straints are imposed, is given by 

(42) 


f dxA _ r j 
\dai)r T 


Hu 

H 


< 0 


x ’ r ■ 

if [H~] is the matrix of a negative definite form, and the matrix 
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3% is neKative semi-definite. (Cf. Mathematical Appendix A, 
Section IV.) Adopting the convention that 


H = «//, 


equation (38) fits in as a special case of (42). 

What is the effect upon the rate of change of r, with respect to 
a< of adding an additional constraint? Clearly 

_ ( £[*f\ n T H, ( -»//„ 

\dajr \dcnjr-i “ '// -1 H 


r H„ ■ 
T H ~ 


'H K+T . 


T H r H* + 


'H'Hn+r , 


(43) 


by a well-known theorem on determinants (Jacobi). 

The denominator is positive because such bordered principal 
minors regardless of the number of bordering rows must be of the 
same sign ** Hence, the difference is positive 
We have the following general theorem. 



While the change in an x with respect to its own parameter is 
always negative regardless of the number of constraints, it is most 
negative when there are no constraints, only less so when there is 
a single constraint, and so forth, until the number of auxiliary 
constraints reaches the maximum possible, namely (n — l). ,s 

This explains why economically long-run demands are more 
elastic than those in the short-run. A lengthening of the time 

“See Mathematical Appendix A equation (48) It should be noted that our r 
corresponds to m there, and r there corresponds to n here 

11 This is a purely mathematical theorem It corresponds to some of the phenomena 
which fall under the heading of the celebrated principle of Le Chatelier Because of the 
almost metaphysical vagueness of its formulation, the latter s meaning is often in doubt, 
and it is used at one and the same time to cover diverse phenomena The above formu- 
lation explains why the change in volume with respect to a given change m pressure is 
greater when temperature is constant than when entropy is held constant and tempera 
~ w Permitted to vary in accordance with the conditions of equilibrium 
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period so as to permit new factors to be varied will result in greater 
changes in the factor whose price has changed, regardless of whether 
the factors permitted to vary are complementary or competitive 
with the one whose price has changed. 


Economic Illustrations 


It is not hard to point out the relevance of the above for a large 
number of economic problems. First, let us repeat the result of 
our previous analysis. 


t a** > o 

Jx <°‘ dai > U - 


(45) 


This states that the direction of the change of the ith variable 
with respect to its corresponding parameter is of the same sign as 
/*,«*,( = Bx.a*). This quantity may be taken as a criterion. If its 
algebraic sign is definite, i.e., unambiguously determinable, then 
the sign of dxi/dcn is also definite. 

We have merely to show, therefore, that a wide variety of eco- 
nomic problems can be so formulated as to yield a conclusive de- 
termination of the sign of our criterion. 

It may be noted that for maximum problems involving a single 
variable no restrictions need be placed upon z in order that our 
criterion be applicable. Let 

3 = f(x, a). (46) 

Then our criterion becomes 

/~fs >0 - («) 


I list below some random examples chosen to illustrate the 
applicability of the criterion. 

(a) Let us go back to the example of the second chapter of the 
effect of a unit tax on output. It will be remembered that the 
relationship between the equilibrium output and the tax was de- 
termined to be unambiguously negative. This conclusion can be 
quickly obtained by our present method. Profit is defined as 
follows: 

t r = <p(x, t ) = \jcp(x) - C(x)] — tx. (48) 

Our criterion may be easily computed. 

- 1 . 


TTxt — <Pxt — 


(49) 
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Therefore, 

T 

V 

o 

(50) 

or 

dx ^ n 



at < 0 

(51) 


Also, as shown in the previous section, 

dir dx 

dt ~ at " “ * 

To a first approximation, the change in profits is unaffected by 
the adjustment of output To a more accurate approximation, 
profits are reduced less if output is optimally altered 

(b) The effects of three other kinds of taxes are also simply 
derivable Consider respectively a percentage tax on gross sales, 
a lump sum tax, and a percentage ta.x on profits The correspond 
mg profit functions may be written 

ir = [*/>(*) - C(x)] - l'xp{x), 
r - [*£(*) - C(*)] - l” (52) 

r « [xp{x) - C(x)] - t"'[xp{x) - C(*)] 

Our criteria are respectively 

t„ = -£o/>m]<o, 

i„ - o, (51) 

djr(x, l'”) n 

" dT~ ~ 0 


Obviously, therefore, for the first case of a tax on gross sales, the 
effect of an increase in the tax rate is to reduce output The last 
■two cases ixowever^ present a new feature Our cn tenon is neither 
positive nor negative, but equal to zero A little reflection reveals 
that our equilibrium maximum conditions are essentially inde- 
pendent of the parameters changed Hence, our equilibrium 
values remain unchanged, i e , 


dx ^ 
dt ' - 


0 , 


dx 
m " 


o 


( 54 ) 
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These conclusions are, of course, familiar from the Marshallian 
analysis. 

(c) Let us now consider a problem which has nothing to do with 
taxation, but which figured prominently in the famous cost con- 
troversy of a few years back. Let us suppose a firm in pure com- 
petition, i.e., one which can sell as much of its output as it wishes 
without affecting the price. Given total cost in function of output, 
there will be a determinable output reaction to each given price. 
What is the nature of this dependence? From the fact that the 
firm is assumed to be in stable equilibrium when it enjoys a proper 
relative maximum, we can easily apply our criterion to deduce the 
properties of the supply curve. Here 


7T = <p(x, p) = px - C{x), 

dir 


dx 


= p - C'(x) = 0. 


(55) 

(56) 


Equation (56) can be solved to determine output in 

function of 

price. 


* = g(Pb 

(57) 

It is easily verified that 


TT X p = <Pxp = L 

(58) 

Therefore, 


| = I'M > °- 

(59) 


Even if we relax the requirement that a regular relative maximum 
be realized, it still remains true that the supply curve cannot be 
negatively inclined. Of course, this does not mean that the mar- 
ginal cost curve cannot be negatively inclined, but merely that in 
such ranges it cannot serve as a supply curve. 

(d) It must not be thought, however, that the assumption of 
our equilibrium as the solution of a maximum problem is the “open 
sesame” to the successful unambiguous determination of all possible 
questions which we may ask. For it is extremely easy to specify 
simple and important problems which cannot be answered even 
qualitatively without further knowledge. 

Consider the problem of the effect of the introduction of adver- 
tising expense upon the output of a monopolist. Will an increased 
advertising expense result in a larger or smaller output? Here 

tt = <p(x, a) = R(x, a ) — C(x) — a, (60) 
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Under discriminating monopoly all prices are regarded as inde- 
pendently variable and are adjusted so as to maximize profit. 
Profit may be written as 

tt = piD\p x ) + p«D*(P*) ~ C[p i {px) + D*{pP)-]. (65) 

For simplicity we rule out the possibility that all of the demand 
at a given price in a given market may not be satisfied by the 
entrepreneur. The removal of this restriction can be easily made. 
Here the conditions of equilibrium are 


&-0 + 

^ = 0 -ZH(W+Ch-C0^. 


( 66 ) 


This results in an optimal set of prices (p x 0 , p»°) and quantities 
(xi°, .%' 2 °), and total quantity (X°), or (xi° + x 2 °). 

In the case of simple monopoly we impose the condition upon 
our problem that the price be equal in both markets. Thus, 

pi — pi — P, (67) 

TT = PDKP) + PDKP ) - C[D\P) + DKpn (68) 

|-0= DHt) + DKP) + (P - CO ( : % + %)■ ■ (69) 

This yields as a solution ( p r , p'), (x/, x 2 '). and ( X '). Is it possible 
to determine whether 

X' ^ A 0 ? (70) 

At first glance there is little resemblance between this and the 
examples hitherto discussed. No data have been introduced as 
parameters; furthermore, we are not comparing two situations in- 
definitely close to one another. On the contrary, we seem to be 
dealing with two entirely different kinds of behavior. The two 
solutions seem to result from qualitatively different types of maxi- 
mum problems. Nevertheless, we may avail ourselves of an arti- 
fice by means of which we can bring to bear the methods used above. 

Let us introduce a parameter defined as follows: 
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We may now treat as independent variables (p t k) instead of 
(Pi P ■») since there is a one to one relationship between the two 
sets Therefore 


t= F(p, p) = F(Pi kp x ) = G(p k) (72) 

In the case of discriminating monopoly both prices are varied indc 
pendently so as to maximize profit This is equivalent to the 
condition that both pi and k must be allowed to vary so as to 
maximize profit Our conditions of equilibrium arc 


dw 

dpi 

dir 

dk 


dG 

dpi 

dG 
dk '' 


(73) 


I Ins must of course determine the same solution as given in (66) 
since we have only transformed our variables 

For simple monopoly it is a preassigned condition of the prob 
lem that both prices be the same or that k be equal to unity 
Thus by continuous movement of k from unity to p2°/Pi° we may 
pass from simple to discriminating monopoly If then we could 
unambiguously state the direction of the rate of change of output 
with respect to k at each point it would be possible to determine 
whether output would be larger or smaller 

In order to ascertain the rate of change of total output with 
respect to k let us still further transform our variables as follows 

k =£ 

» (74) 

X = £>■(? ■) + 0 ! (P>) 


Solving inversely we get 

pi = f*(k X) 

Pz = P(k X) 

Thus 

r = F(p, P ) = FUKk X) r(k A-)] = p(* X) 
Our equilibrium condition for a given k is 
dir _ dtp(k A) _ « 
dX - dX 


(75) 

(76) 


( 77 ) 
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According to our criterion 


<PXk 


dX 

dk 


> 0 . 


(78) 


Now the computation of while laborious, is nevertheless 
possible. It will be found to depend in a complicated way upon 
the curvatures of our demand curves. Its algebraic sign will be 
ambiguous, as must therefore be the effect upon total output. 
However, anyone who cares to go through the computation can by 
so doing determine what additional restrictions must be placed 
upon the demand functions in order to insure that output be larger 
or smaller. 

(f) In later chapters the methods outlined here will be sys- 
tematically applied to different branches of theory. In order to 
illustrate that it is possible to derive unambiguous conclusions even 
with a large number of variables a small part of our later analysis 
will be anticipated here. 

Let us consider the demand of a firm for a factor of production. 
We assume as given a production function embodying the tech- 
nological relationships between inputs and output, the demand 
curve for the finished product, and the prices at which all factors 
of production can be bought in unlimited quantities. The condi- 
tion that profit be at a maximum is sufficient to determine the 
value of all unknowns in terms of these data. Thus, 

i r = total revenue — total factor cost 

= R(x ) — ( Widi + w 2 o 2 + • • * + w„c n ), (79) 

where Wi and o,- are the respective prices and quantities of the it h 
factor of production. But we are also given a production function 
(assumed continuous), 

x = x(a } , ■ • •, o„). (80) 

Hence, 

* = /(®i, •••,<*«, wi, •■•,«'„) 

= R[x{a u • - -, o„)3 — (wiCi + • • • + w„a„). (81) 

Recall from (21) our generalized criterion. 

ddj 
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Obviously here 

Hence, 

— 0 (* 5^ j) 

„ dd% 


(83) 

But 

> 0 (* = ! . 

,n ) 

(84) 

So 

m-l (t = 1, 

. ») 

(85) 


< 0 (i = l 

dlV l K 

,n ) 

( 86 ) 


This conclusion holds for any number of factors 

The above examples are but a small sample of economic prob 
Iems which can be regarded as determined by the solution of 
maxima problems For these, the criteria outlined above may be 
employed to deduce meaningful unambiguous qualitative theorems 

Analysis of Finite Changes 
Up until now the analysis has been confined almost exclusively 
to the determination of the algebraic signs of instantaneous rates 
of change One cannot leave the matter here, for in the world of 
real phenomena all changes are necessarily finite and instantaneous 
rates of change remain only limiting abstractions It is imperative 
therefore that we develop the implications of our analysis for finite 
changes Fortunately, despite the impression current among many 
economists that the calculus can only be applied to infinitesimal 
movements this is easily done 

Let us consider for simplicity a functional relationship between 
one variable, x, and one parameter, a continuous and twice differ 
entiable everywhere on a given interval 

* = g(“) o S a b (87) 

Suppose that we have already ascertained the fact that the alge 
braic sign of the instantaneous rate of change of this function is 
everywhere negative on the defined interval, 1 e , 

^ = g '(o) <0 a g o £ b ( 88 ) 

da & v 

It follows then that any finite change m a within this interval will 
accompanied by a finite change in x in the opposite sense 
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For let 

Aa = a 1 - a 0 , 

Ax = x 1 - x° = g(a*) - g(a°). u< a »<J 
We wish to prove that 

AxAa 0. Aa 5 ^ 0. 

By the theorem of the mean 

Ax = g(a J ) - g(a°) = f 1 g'{a)da = g'(e)Aa, 

J crO 

where 

e = a 0 + 0 (a J - a 0 ). 0 < 0 < 1 

But since the derivative g' is negative everywhere on the defined 
interval, it must be negative for a = e. Therefore, Ax and Aa are 
of opposite sign, for 

AxAa = g'(e)(Aa ) 2 < 0. (92) 

This result is intuitively obvious. If, starting from a given 
point, a curve is always falling, then one should not expect it to be 
higher after a finite distance, continuity being assumed. 

The above proof rested on the assumption that the derivative 
be unambiguously negative everywhere. This is often an ad- 
missible hypothesis as we have seen. But what about the cases 
in which the derivative is known to be unambiguously negative 
only at a point? Thus, 

g'(a°) <0. a< a 0 < b (93) 

Can anything be said then about finite changes? Our answer is 
in the affirmative. It can be shown that for all finite changes in a 
smaller than some assignable quantity, there exist corresponding finite 
changes in x of the opposite sense. 

For by hypothesis g'(ct) is continuous on the given interval, 
and of course at a 0 . Hence, from the elementary definition of 
continuity there exists a neighborhood around a 0 in which g (or) 
is always negative, i.e., 

g'(a) < 0. I a-a°\<h (94) 

Thus, from our previous theorem 

AxAa = (x — x°)(a — a 0 ) <0. |a — a°|< h (95) 


(90) 

(91) 
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The implication of the last theorem is important In order 
that the results of all finite movements be unambiguous, it is 
necessary that the sign of the instantaneous derivative be definite 
everywhere For let the derivative change sign, and it will be 
possible to find contradictory finite movements 

Thus far we have assumed that our equilibrium equations are 
such as to enable us to solve for our unknowns uniquely in terms 
of our parameters Under what conditions will this be possible? 
It will be readily seen that even where our equilibrium conditions 
are defined as the result of a maximum problem there remains the 
possibility of multiple relative maximum positions Under what 
conditions are we able to get a unique explicit solution of our 
implicit equations? What shall be done if multiple solutions are 
possible? 

Assume the implicit equations 
/* = f x (xi x„ a* , a«) =0, (t — 1, , n) (96) 

and a set of values (xi°, , x*®, af, , a m ° ) satisfying these equa 
tions It is known from the Implicit Function Theorem 14 that 
there exists one and only one explicit solution of the form 

Xi = g’(c*i a n ) (* = 1, n) (97) 

in a region around (x v ° x n °, a.\ t , am 0 ) where the following 
functional determinant does not vanish 

A.*. /*,*, Art 

Art Art A*. 

//- 1 /„,|- ^0 ( 98 > 

/.... 

Since we shall be considenng chieffy the sefection oi regui'ar 
maximum positions, we may assume 

(_ = (_ 0 (99) 

at the point of equilibrium Providing this expression remains 
positive everywhere we may be sure of a unique equilibrium Of 
course this is sufficient but not necessary 

' See any \dvanced Calculus 
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Suppose; however, that the Hessian does change sign a finite 
number of times within the region of economically admissible 
values. Then the functions in (97) will be multiple-valued with a 
finite number of branches. Some of these may be ruled out im- 
mediately as not constituting maximum positions, namely those 
for which 

(- 1 YH < 0 . ( 100 ) 

It is possible to choose from the remaining branches only by re- 
ferring back to our original maximum problem. Let (xd, • • - , xf ) , 
(xx 2 , • • •, x n 2 ), • • • correspond to a preassigned set (an, ■••,«„) as 
multiple solutions of (96). That set (or sets) is retained for which 
/ is greatest. This will ordinarily serve to define our x’s as single- 
valued functions of the as except at a finite number of points. 
At these points it is a matter of indifference which of the possible 
alternative solutions is embraced. 

Broadly speaking this possibility of multiple equilibrium offers 
no serious difficulty. For all qualitative results remain. To illus- 
trate this let us consider the problem in full generality. Let 

2 = /(x 1 , • • •, x n , on, ■••,«„) = 0(xi, • • •, x n ) + B l (x 1 , on) 

+ 5 2 (x 2 , an) + •••-+- B”(x n , cc n ). (101) 

Consider a preassigned set of values of our parameters (an 0 , ■ • ■ , a„°) , 
and a corresponding optimal set of values for our unknowns 
(xi°, • • • , Xn ) , not assumed to be necessarily unique. Then by 
definition of a maximum 

fix 1 , • • • , x n , ai °, • - • , a„°) =i /(x,°, • • • , x„°, a! 0 , • • • , a* 0 ) (102) 

where (xi, • • • , x„) take on any values. For brevity this may be 
written 

f(X, a") - f(X°, a 0 ) ^ 0. (103) 

Consider any other preassigned values of our parameters (on , 
• ", an, 1 ), and a corresponding optimal set (xi 1 , • • •, -Xn 1 )- Then 

f{X\ a 1 ) - f(X°, a 1 ) S 0, (104) 

f(X°, a 0 ) - fix 1 , a 0 ) g 0 . 

Adding both sides, we get 

[/(X 1 , a 1 ) - fiX°, cd)] - lf(X\ cd) - f{X°, « 0 )] ^ 


(105) 
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From the definition of a definite integral this can be written 

£ f it CM*. «i*, • • •, «„') 

~-M* i>‘ «i°, •••,a„ 0 )3fe, SO. (106) 

From the same definition this can be further transformed 


? ?/. X. ■■■• 


a:-., a, t ■ • a n )dXjdaj ^ 0. (107) 


But from (101) 

e 0, (* ^ j) 

/*(•( e 

so 

? X« X-* /*.•<(*!> •••,*«,«!, • • •, a P1 )c?x,da i s 0. 

By the theorem of the mean this may be written 


(108) 

(109) 


L /,...A*,Aa { 5; 0, (110) 

i 

where is evaluated at an intermediate point. If we consider 
movements of the £th parameter alone, others being held constant, 
this becomes 

f^Ax^at SO. (k - 1. • • n) (111) 

Hence, if our criterion is definite in sign everywhere, e.g., 

<\), (fc-1, ••-,«) (112) 

then, 

Ax»Aa* ^ 0. (& = 1, •••,») (118) 


This is proof that multiplicity of equilibrium values does not 
alter the definiteness of our conclusions with respect to finite 
changes. It also shows that the criterion which we applied in 
earlier sections to determine the definiteness of instantaneous rates 
of change can be generalized to determine the definiteness of finite 
changes. In fact, by a proper limiting process our previous 
theorems relating to instantaneous rates of change can be deduced 
as special cases from this more general analysis. 

The equality sign can be dropped if A a ^ 0, and if the maxi- 
mum is proper. It will be noted that in this proof we did not 
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require that 6 be continuous, but merely that the B { be continuous 
with the derivatives of the required order. This will turn out to 
be of considerable importance in the later consideration of discon- 
tinuities in the production function of a firm. 

A very important case is that in which 2 takes the form 

z = 6(x u ■•*,*») - £ aiiXi. ( 114 ) 

Then 

e(x') - £ a, 1 *,- 1 ^ 0(35°) - £ aW, (115) 

1 1 

6(x°) - £ cti°Xi° S 6(x l ) - £ a iW. (116) 

1 1 

Adding and collecting terms, we get 

£ (a, 1 - a/>)(*i l - x») s 0, (117) 

1 

or 

£ Aa AXi S 0. (118) 

1 

As illustration of the direct application of our methods to finite 
changes, let us revert back to the firm with a given total cost func- 
tion producing in a purely competitive market. For a preassigned 
price, p°, the firm will supply a given output, x° (not necessarily 
uniquely defined, since the firm may be indifferent in its selection 
among two or more outputs). Since profit is at a maximum for 
this output 

[>V - C(x 5 )] - OV - CM] s 0, (119) 

where x 1 may be any output, in particular that output appropriate 
to a second price, p 1 , such that 

[p l x° - C(x 0 )] - iP'x 1 - C{x 1 )'] == 0. (120) 

Adding we get 

ApAx = ( p l - p*){x l - *°) = 0, ( 121 ) 

ApAx > 0 for Ax ^ 0, Ap ^ 0. 


and 


9 ?) 
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For this analysis it is not necessary that the cost curve be con- 
tinuous, or be such as to yield a unique optimum output. The 
marginal cost curve may be undefined at points, have kinks, and 
turn up and down many times. 

Analytic Functions 

In cases where z is an analytic function of the *'s and the cl s, 
finite changes can be evaluated by an infinite power series in the 
Aa’s, whose coefficients depend upon partial derivatives of all 
orders of z at the original position of equilibrium That is, 

(* = 1, .«) (123) 

The geheral coefficient is of the form 


/ d*x, y 
\da i m » ■ • • da n m ' J ' 


These coefficients can be computed from the equilibrium equations 
by differentiating as many times as is necessary with respect to 
the a’s. If the Hessian is not zero, this will yield recursion rela- 
tions sufficient to get the desired higher derivatives in terms of the 
already computed lower derivatives and the higher partial deriva- 
tives with respect to the x's. 

Convertibility into a Maximum Problem 
Earlier in this chapter it was indicated that some problems 
which do not appear to involve extremum positions can at times 
be converted into an equivalent maximum or minimum problem. 
The advantage to be derived from so doing is merely notational, 
since it will require fully as much knowledge to ascertain whether 
the conditions of a maximum position are met as would be neces- 
sary to answer any questions which might be asked. F urthermore, 
there is the danger that unwarranted teleological and normative 
welfare significance will be attributed to a position of equilibrium 
"< defined. To avoid misunderstanding it is well to emphasize that 
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the conversion of a problem whose economic context does not 
suggest any human, purposive, maximizing behavior into a maxi- 
mum problem is to be regarded as merely a technical device for the 
purpose of quickly developing the properties of that equilibrium 
position. 

Our problem may be stated in the following way. Given initial 
conditions of equilibrium 

f (%li ') X n , ", <^m) = 0, (i — 1, • • *, n ) (125) 

such that our unknowns are determined in function of the given 
parameters, namely, 

Xi — g { {a 1, • • -, a m ), (i = 1, • • •, n) (126) 

under what conditions can the set (125) be regarded as the solution 
of an extremum problem so that the n indicated equilibrium loci 
correspond to the vanishing of the partial derivatives of some func- 
tion? That is, does there exist a function 

2 = fix 1 , • • •, x n , a h • ■ - , a m ) (127) 

such that 


fxfx 1 , ■ • Xn, ax, ■ ■ ■, a r „) =0, (i = l, ■■■, n) (128) 
represents the same locus for each value of i as 

f(x 1 , • • •, x„, «i, • • - , <Xm) = 0? (i = 1, ■■■, n) (129) 

Now the same implicit function may be represented in an in- 
finity of ways without changing the locus it denotes. 15 It is de- 
sirable, therefore, that we represent the loci which are our condi- 
tions of equilibrium in a definite unambiguous form. One such 
way is provided by solving explicitly for each variable in turn, 

Xi = M'(xi, ■ ■ - , Xi- 1 , x ;+ i, • - X n ). (i = l, ■■■, 1 l) (130) 

What conditions are necessary on these functions or on the func- 
tions f so that there exists a z as defined by equations (127) 
and (128)? 

Define 

- - - (fit - - ey ■ w >”» 

“ For example, (/*) 2 = 0 , (i = !,•••,») or F(/ ; ) = °- where F( - 0) = 0; F(o) ^ °’ 


a ^ 0. 
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The X’s are unambiguously determinable independently of the 
representation of the /s. In general, 


X./ 7* X,,. 

If there exists a function z, the vanishing of whose partial deriva- 
tives is equivalent to the equations (125), then 



Also for every possible triplet 


(132) 




/u %°/*. 0 /„°/W 


X; |X*;X ,t 


(133) 


because /„ — /,, for all possible pairs 

These conditions are necessary They are not identities for 
all values of ( x \ , , x*), but only for those satisfying 


/'(*!.• ,*») = 0, (1 = 1, ,n) 


i.e., they hold only at the point (xi°, , x„°) 18 They are not to 

be confused with the integrability conditions of chapter v Though 
necessary, they are probably not sufficient Quite possibly similar 
point reciprocity relations can be found between various derivatives 
of higher order If all of these should hold, perhaps a complete set 
of sufficient conditions would be attained 

The relationships given inequation (133) are vacuous if the num- 
ber of variables is less than three All together n(n — 1)(» — 2 )/6 
independent conditions are involved 17 

14 In addition, for all values of (x) satisfying a subset of r(< «) equations /* = 0, 
there will be defined similar relations which are identities in the remaining (n — r) 
variables 

11 Any completely non -specialized set of n equations inn variables, / (*j, ,x») =0, 
can be regarded as equivalent to a stationary position of a function of In variables 
Let 

F(x„ , X.. Xm h. , *.+») = I! /*(t i, , *.)«.♦< 

dF = 0 implies among other things that /*(xi, , x.) = 0 This is definitely not an 

extremum position as shown by reference to the secondary conditions The fact that 
m a larger set of variables a stationary value corresponds to a non specialized system 
seems devoid of economic significance See G D Birkhoff, Dynamical Systems (New 
York 1927), pp 33-34 
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As we have seen, it is not enough that our equilibrium equations 
be expressible as the partial derivatives of some function. In order 
to be able to derive definite theorems it is desirable that this func- 
tion be at a regular maximum or minimum. This requires as is 
shown in Mathematical Appendix A, Section II, that certain 
quadratic forms be either negative or positive definite. Thus, for 
a maximum 


Z Z fijhihj < 0, 

i i 


not all hi = 0. 


(134) 


As shown in Appendix A, Section III, this requires that the follow- 
ing inequalities hold. 


l/ii I < 0; 


/n 

/21 


/12 


> 0 ; 


/11 /12 

/ 2 I /22 

/31 /a 2 


/l3 

/23 

fa3 


< 0 ; 


etc. (135) 


For a regular minimum these determinants are all positive. It can 
be easily shown that either of these is equivalent to the following 
conditions. 


1 A,-; 

A ji 1 


1 

A»7 

Aft 

>0; 

A a 

1 

A jk 


Afci 

A hj 

1 


> 0 ; 


etc. 


(136) 


for i, j, and k all unequal. This proves that it is not important 
that our problem be either a maximum or minimum one, but only 
that it be one or the other. 


As an example of a problem which can artificially be converted 
into an equivalent maximum problem, consider a number of inde- 
pendent firms buying the same kinds of productive services in 
purely competitive markets. 18 The demand of any firm for the 
factors of production can be written 

Vi = r^pi, • • •, p n ), (i~\,---,n) (137) 

where (i»i, • • ■ ,v n ) represent respective amounts of n factors of pro- 
duction, and (pi, ■ ■ p n ) their respective prices. It can be shown 

18 H. Hotelling, "Edgeworth’s Taxation Paradox and the Nature of Demand and 
Supply Functions," Journal of Political Economy, XL (1932), 577-616. L. Court, ‘ In- 
variable Classical Stability of Entrepreneurial Demand and Supply Functions,” Quarterly 
Journal of Economics, LVI (1941), 134-144. R. Roy, De I’UlilitS, Contribution A la Thforie 
des Choix (Paris: 1942). 
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that 

dv , __ dt)j 

dpj 3/>,- (138) 

Where we sum over the individuals and use capital letters for the 
total amounts demanded 

V,~ D ». = XHPu • ' Pn), (i = 1, •••,«) . (139) 


From (13B) 

aR‘ 3/?' 

“ */>, 

There exists, therefore, a function 


(140) 


2 = •••,/>.) ~ (Vtpi + + • • * + V.*.). (141) 

where 

= (<-I, •••,*) (142) 

The conditions of equilibrium represented by the general demand 
functions of (139) are, therefore, equivalent to those derived from 
the condition that Z be at a maximum with respect to the p's, i.e., 


9Z_ 

dpi 


= R> _ V, s 0. 


(t = !,■••»«) 


(143) 



CHAPTER IV 


A COMPREHENSIVE RESTATEMENT OF THE 
THEORY OF COST AND PRODUCTION 

Economic theory as taught in the textbooks has often tended to 
become segmentalized into loosely integrated compartments, such 
as production, value, and distribution. There are, no doubt, peda- 
gogical advantages in such a treatment, and yet something of the 
essential unity and interdependence of economic forces is lost in so 
doing. A case in point is the conventional assuming of a cost 
curve for each firm and the working out of its optimum output with 
respect to its demand conditions. Only later is the problem of the 
purchase of factors of production by the firm investigated, and 
often its connection with the previous process is not clearly 
brought out. 

I should like here to investigate from the point of view of the 
previous chapters the cost curves of the firm as usually presented 
and the production function embodying technical relations between 
inputs and output which lie behind it, and also to show clearly its 
relevance to the problem of the determination of the optimum 
output. In particular, I shall attempt to derive all possible opera- 
tionally meaningful theorems. Much of what is said will be found 
to apply regardless of the elasticity of the demand curve for the 
firm; i.e., under “impure” as well as “pure” competition. By em- 
ploying a suitable notation it is possible from a purely technical 
point of view to consider the case of any number of productive 
factors as easily as one or two. 

Statement of Problems 

In the beginning the revenue side of the firm is completely 
ignored. We assume as given by technical considerations the 
maximum amount of output, x, which can be produced from any 
given set of inputs (v u ■ ■ ■ , v„). This catalogue of possibilities is 
the production function and may be written 

x = <p{v i, • • •, v „ ). 
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In general there will be a maximum output for each set of inputs 
and so this function is single valued and will be assumed initially 
to have continuous partial derivatives of desired order Further 
more no marginal physical productivities can be negative else 
output would not be maximal since it could be improved for the 
same set of factors by leaving some idle Regardless of this con 
sideration in the relevant region under consideration we will have 

S 0 (t = 1 n) (2) 

where as a convention of notation 

<pt “ = marginal physical (degree of) productivity Simi 

larly, we define 

_ d'x 
* ' dv dv, 

It is also assumed that each firm is small relative to the market 
for each input so that unlimited amounts of each can be bought 
at the respective prices (zo t w n ) 

As a matter of definition the total cost of the firm may be 
written as the sum of the costs for each input item and any other 
costs which are independent of the purchase of the designated 
inputs and output t e 

C = A + t (3) 

where A represents costs which do not vary with the designated 
inputs and output (taxes etc ) Of course these fixed charges 
may be zero 

A brief survey of our field by economic intuition will aid in 
phrasing the problems to be inv estigated after which mathematical 
analysis may be employed in stating the conditions imposed on our 
various functions Our aim is to derive the total cost for each 
output More precisely, with given prices of productive factors 
and given production function we are interested in deriving the 
minimum total cost for each output This will be a function as 
follows 

C = A + V(x wi w.) ( 4 ) 

_ If we regard the prices of the productive factors as constant the 
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resulting relationship between C and x is the usual total cost curve, 
from which average and marginal curves can be derived. 

The question arises as to the relationship between (3) and (4). 
It is clear that the same output can be produced by an infinity of 
combinations of productive factors so that in the absence of other 
considerations it is impossible to determine the total cost uniquely 
for each output. In view of the consideration that total cost for 
each output be a minimum, our indeterminacy disappears. Out 
of the set of all possible input combinations which will produce a 
given output, that combination is selected which minimizes the 
total cost defined in (3). In other words, with given prices of pro- 
ductive factors and for a preassigned output, there is an optimum 
value for each productive factor; i.e., 

Vi = f(x, ®i, (i = 1, - • •, n) (5) 

By substitution into (3), we have 
« 

C = A + £ Wif(x, wi, • • •, w„) = A + V(x, wi, ■■■, w n ). (6) 
1 

Thus, the relationship between (3) and (4) is revealed. 

It is part of the purpose of this book to investigate the proper- 
ties of the functions (4) and (5). It is true that theoretical eco- 
nomics does not deal with particular forms of functions (e.g,, 
polynomials, etc.). However, it does concern itself with the 
general character of various functions; that is, their slopes, curva- 
ture, etc. In this case we are interested in the following properties 
of these functions: 

dc dc_ e 2 c m 

dx ’ dx z ’ dWj ’ dxdWj ’ 

i.e., how are total and marginal costs affected by changes in output 
or prices of productive services? On what properties of the pro- 
duction function does this depend? 

We are also interested in 


dVj dvj dvi _ 

dwi ’ dWj ’ dx ’ 

i.e., what is the reaction of demand for a productive service with 
a change in its own price? With a change in another price? With 
a change in output? 
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These are obviously important theoretical questions, and yet, 
curiously enough, the answers to some of them do not appear to be 
in the literature. In the next section these will be mathematically 
evaluated, and the results summarized. It will be found that 
under very general assumptions the knowledge of the signs of these 
differential quotients will give us the direction of change with 
respect not only to sufficiently small finite movements, but also to 
finite movements of any size. 

Conditions of Equilibrium 

Thus far we have employed only the notation of mathematics. 
By so doing, the problem has been clearly framed, which in many 
economic problems is more than half the battle. It remains now 
to state the derivations of (4) and (5) from (1) and (3). 

Our problem is to minimize 

C = A + £ w,v, (9) 

subject to 

<p(v u •••,?„) = x - constant. (10) 

Mathematically, this is a constrained minimum problem, and we 
may avail ourselves of the method of the Lagrangean (undeter- 
mined) multiplier. We define a new function 

G = A 4- T. ~ • • •, »«) — O- 

i 

where (— X) is a Lagrangean multiplier, whose economic interpre- 
tation will be brought out later. G may be regarded as a function 
of all the inputs as independent variables. It is necessary for a 
proper relative minimum that 

~ = u — — Vw»- f <* — * ' ' » 

dv. 

These may be rewritten as 

J. _ vi _ (13) 

X Wi W t Wn 

This is the well-known economic theorem that in order for total costs 
to be a minimum for any given output, the marginal productivity of 
the last dollar (i/X) must be equal in every use. Alternatively, it may 
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be stated that the marginal physical productivity of any factor 
must be proportional to the price at which it can be hired, the 
factor of proportionality being the term X. 1 It will be noted that 
this condition is independent of the revenue curve of the firm and must 
hold at every point on the cost curve , not only at the final point of 
optimal output. 

Provided that certain secondary conditions to be discussed 
presently hold, the n equations in (12) and the one equation in 
(10) are sufficient to determine each of our (« + 1) unknowns 
(vi, • • • , v n , X) in terms of x and the factor prices (w i, • • - , w n ) re- 
garded as parameters. Thus, our equations (5) are defined im- 
plicitly by these minimum conditions. 

One may well question the advantage of this formulation. Ap- 
parently we have substituted an indirect statement of our final 
conditions for a direct one. But, and this is typical oft all sound 
economic theory, we are trying to deduce the consequences of our 
hypothesized data, and we do know (by hypothesis) much about 
the functions in (12). Merely,to state the relations in (4) and (5) 
is formal and empty. By means of (12) we can place positive 
restrictions upon them and know their general properties. 

Now there remains the purely mathematical problem of trans- 
lating our assumptions into terms of the functions (4) and (5), and 
computing their respective partial derivatives designated in (7) 
and (8). 

Secondary Extremum Conditions 

To do so we must first state all that we know about the condi- 
tions defined in (12), in particular the secondary necessary and 
sufficient conditions for a proper relative constrained minimum. 
It is clear that in order for total cost to be a minimum for a pre- 
assigned x, the locus of all possible inputs yielding that preassigned 
quantity (isoquant surface) must be tangential to a locus of all 
possible inputs yielding the same total cost (isocost plane). But 
of course this is not enough. The isoquants must also be convex 
to the origin in all directions in order that its contact with the 
isocost plane represent a true proper minimum. The analogy with 
the theory of consumer’s preference suggests itself. This is brought 
out more clearly if we phrase the problem not as that of minimizing 

1 Later X will be shown to be equal to marginal cost, i.e., 3C/dx. 
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total cost for a preassigned output, but rather in the equivalent 
form of maximizing output for any preassigned total expenditure 
Mathematically, our secondary conditions are 


£ £ < 0 , 

for * 


and not all 


£ *>,£. = 0 , 

i 

f. = 0 


Consider the bordered determinant 


‘-I 


(14) 


S' 2— 


Vn V\2 
Vi i 



Vn 1 Vn2 
Vl V2 


and respective principal minors 



Vn 

Vn 

Vi 


D n = 

Vn 

V22 

V2 

, £> ,?3 = 


Vl 

Vi 

0 



Vn 

V22 

Vsi 

V2 


Vln Vl\ 
Vtn Vl j 


Van Vn 
Vn 0 


<?13 Vl 
•Pi 3 <P2 

VS3 VS 
Vs 0 I 


etc 


(IS) 


(16) 


It is well known 5 that (14) implies that any such minor of the 
order m must have the sign of (— l)" -1 , and conversely, i e , 

(- i)—i D i 2 c«-D >o (w < n + 1) (17) 

Specifically, 


VhV* ~ ZVuViVt + VsiVx < 0, etc (* j) 

It will be noted that this condition does not necessarily imply or 
require the law of diminishing marginal physical productivity 
to hold 3 


* Compare H Hotelling “Demand Functions with Limited Budgets," Economelrtca, 
January, 1935, pp 66-78 

* J am assuming that these secondary conditions hold not only at the mwumuta 
point, but everywhere Mathematical!) , this assures us of the uniqueness of our cqui 

1 i, since this stronger assumption -deiinicety-rales oat multiple relative minima 
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The secondary conditions are not always mentioned in the 
literature. It is not for elegance or completeness that they are 
included here, but rather because they are wholly relevant to the 
problem under discussion, since it is upon them that all our results 
depend. 


Displacement of Equilibrium 

It is now possible to derive in compact form the rates of change 
of our dependent variables (v h •-•,»„) with respect to changes in 
the variables (x, w i} • • • , w n ) . The reader not interested in the 
mathematical derivation of these conditions is referred to a sum- 
mary of results at the end of this section. First we write the total 
differential of our equilibrium equations (12) and (10). 

£ <Piidvj + f d\ = ^ , ft = 1 ,•••,») (18) 


n 

Y. (fjdvj = dx. 
1 


These are (n + 1) linear equations in (n + 1) unknowns 
(dv 1 , • • •, dv„, d\), and may be solved in determinantal notation as 
follows: 


where 


A a + dx A„ + i, t 


A = 


rr* 

II 

i 


A 

i 






Pi 


pn 

Pl2 

Pm 

X 





P2 



P 21 

P 22 

Pin 

X 

<pn\ Pi 






Pi X 






0 


. 

. 

. 

. 





Pn 


Pnl 

PnZ 

’ ^Pnn 

X 


Pi 

<P2 

• • • Pn 

0 


(19) 


( 20 ) 


and A, r is the cofactor of the element in the rth column and the 
gth row. 
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Also, 


d\ 

” dw x 

T X A, ‘+ I +H+u+i 

(21) 

A 

Hence, 

tok _ A, k 
dw, XA 

(22) 

As a special case 

dVk _ A kk 
dw k XA 

(23) 

Also, 

dv t A n4 .i k 
dx 53 A 

(24) 

and 

aX __ Ak n-M 
dw k XA ’ 

(25) 


ax A n+1 B+v 

dx A 

(26) 


It is clear on inspection of the determinant A, that 

(27) 

Also 


A,k = ^ D )k — Akf (j, k — 1 , 

,n) 

(28) 

A/ n+l — D) n+l = XA n+ i , 0 = 1, 

. «) 

(29) 

and 



A„+\ «+l = D n+l n+l 


(30) 

Therefore, from (28) 



dVk dv, 

dw, ~ dWk 


(31) 


That is to say, the change in the kth factor with respect to a change 
tn the jth prtce , output being constant , must be equal to the change tn 
the jth factor with respect to the kth price, output being constant, a 
result which is not intuitively obvious 
From (27) and (28) 

Ajk _ D,k 

A ~ D 


( 32 ) 
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But from our stability conditions in (17) 


< 0- (J = l. • ••» ») 

Therefore, 

(33) 


(34) 

That is, any fixed output will always be produced with less of any 
given factor as its respective price rises , other prices not changing. 

By the law of the mean this can be shown to hold for finite changes. 

Let us now determine the economic meaning of X. Rewrit- 
ing (11) 

n 

G = A + Z WNi — XO(j/i, •••,»„) — x], 

1 


and differentiating G, which is total cost with a term added on, 
we get 

^-x 

dx 

This suggests that X may be marginal cost. This can 
rigorously in two ways. Of course, 

be proved 

n 

dC = Z wfivu 

1 

(35) 

and 


n 

dx = Z pidvi. 

1 

(36) 

Dividing (35) by (36), we get 


ac ? 

£ <Pidvi 

1 

(37) 

Substituting from (12), 


n 

Z R<pidVi 

~ = = X(.%*, Wi, ■ • • , W»). 

dx Z <Pidvi 

1 

r 
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More rigorously the proof is as follows 


DC ■ a», 

«T = 


Substituting from (24), 


= f „ -\y„ A„ + , . 

But expanding A in terms of the elements of the last row, 

A - i vA*+i • 

i 

Hence, 


BC 

dx 


Xy = X 
A 


Therefore, we may rewnte (12) as follows 
dC 


(39) 

(40) 

(41) 

(42) 

(43) 


Thus, it may be stated as a theorem that in order for total costs to 
be a minimum for any given output, the price of each factor must be 
equal to marginal physical productivity limes marginal cost 4 This 
holds regardless of revenue considerations 
Of course, 


and 


From (24), (25), and (29), 


ax _ 

d'C 

(44) 

dx 

: dx ' ' 

ax 

d*C 

(45) 

ii 

II 

dxdWk 

ax 

dWt 

dv k 
dx ’ 

(46) 

a*c 

dvt 

(47) 

dxdWk 

~ dx 


« This has been pointed out in another connection m lectures by Professor Viner 
with illuminating insight into the relationship of the external to the internal margin and 
the broad zone of indifference as a substitute for the infinitesimal Paradoxically, it is 
this condition which is basic to Rlr Wong s famous envelope theorem 1 
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That is, the change in any input item with respect to an increase 
in output must be equal to the change in marginal cost with respect 
to a change in the price of that input item. 

Recall from (26) 

ax d 2 C A n+1 , n+ , 

dx dx 2 ~ A ' ^8) 


Now it is known from (17) and (27) that A has the sign of ( — 1)". 
Also 


<Pll • • • (pin 


A 


n+l»ra-M — 


= H, 


<Pn 1 - ‘ ‘ (pnn 


(49) 


where II is called the Hessian determinant of the production func- 
tion. Obviously, therefore, the slope oj the marginal cost curve must 
have the same or opposite sign as compared to this Hessian, depending 
upon whether the number of inputs is even or odd; i.e., 

(SO) 


Thus the stability of pure competition is intimately tied up with 
the Hessian of the production function, a result not intuitively 
obvious. 

I should also like to indicate certain other results, leaving their 
rigorous derivation to the interested reader. Consider equation (5) , 

Vi = /*’(#> Wl, ■ ■ ■ , Wn). (i — 1. •••>») 

These functions are defined by (10) and (12), 


and 


x = <p(v u • • • , v n ) 

Wi — \(fi = 0. (7 = 1, • • •, n ) 


(12) may be rewritten as follows: 


— = — . (7 = 2, •••,«) (51) 

<p i Wi 

Obviouslv, the changing of all prices in the same proportion will 
leave the solution of (51) unchanged; hence, the function 
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must be homogeneous of order zero in the variables (w,, . . . , W|i ) t 
x being constant; i.e., 

V, = /'(*. »1, • • • , Wn) *= /*(*, TWl, • ■ • , yw „), (* = 1, . . . , ft) (52) 

where y is arbitrary. Hence, from Euler's theorem on homogene- 
ous functions, 

0 = C- 1 ’ (53) 


This can be verified by substitution from (22). 
By analogous reasoning, 

r a . «£ 

C = yl-f E if. , 
i dzu, 

where 


OC 

a ~ IV 

diPj 


( 54 ) 

(55) 


Actually, from the considerations of the previous chapter it is 
possible to deduce a more general condition which includes (34) as 
one minor part. The minimization of total expenditure for a given 
output, price being fixed, implies (10) and (12). In addition, at a 
regular minimum we must have 5 




( 56 ) 


for not all £, proportional to w„ Consider the determinant 



r ' ' ' 
a(wt, • • ■ 


and 

I dVi dvi I 


„ a(»i. c,) 

_ [ dwi dwt 1 

d(vi,Vi,v a ) . 

12 d(w lt wt ) 

~~ 1 dv 2 dv 2 | 

’ m d(w u w 2l W 3 ) ’ 


\dwi dw t\ 



Then each such principal minor of order m (less than n) must be 
negative or positive depending upon whether m is odd or even; i.e„ 


(- 1 > 0. = G = 0. (59) 


1 Compare chap m, p 30, Mathematical Appendix A, Sections IV and V, and chap v, 
pp 113-116. 
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Specifically, 


dVj 

dw 


L n . dvj_dv k ( dVj V 

9 ’ >0; etc - 


(60) 


It may be well to summarize our results of this section: 

dVj 


dWj 


< 0 ; 


d (vj, Vk) 

diwj, Wk ) > °: etc - (34) and (59) 


dv k 

dVj 

II 

l<"0 

dw k ' 

d 2 C 

dVj 

dxdWj 

~ dx ’ 


. dC 
Wi = X<Pi = Vi ’ 


(-i)^H <0, 


(31) 
(47) 

(« = 1, •••,») (12) and (42) 

(50) 


v- dV{ n 

X, T Wy = 0, 

1 dWy 7 

dC 


(*=!,•••,») 


aw,- ” 

~ „ , " ac 

C = i4 + E Wi r 

1 5^,- 


(53) 
(55) 

(54) 


Boundary or Corner Minima 

Even in the case where the production function and its deriva- 
tives are continuous with the proper convexity to insure a uniquely 
determined optimal position, the interesting case may arise where 
some factor may not be used at all. That is to say, the more that 
others are used and the less of it, the lower will costs be for any 
given output. In this case the conditions for equilibrium do not 
require the equalization of the marginal productivity of the last 
dollar spent on it to that of other factors. Rather do we have a 
boundary minimum due to the fact that no negative values are 
economically admissible. Hence, the conditions of equilibrium 
are given by the statement that for any input, potentially usable 
but not actually used, the marginal productivity of the last dollar 



70 


FOUNDATIONS OF ECONOMIC ANALYSIS 

spent on it must not be greater than the marginal productivity of 
the last dollar spent on factors which are used 6 
Mathematically, 

<Pu 1 

w u = X ’ (61) 

where the uth factor is one not actually used. 

< As price w u changes, it may still remain unused until some 
critical level at which it will begin to be used , and hence will fall 
into the analysis of the previous section. Of course, the critical 
level may very well depend on the scale of operations; i e , output, 
so that with the same price it may still be brought into use by an 
increase in output 1 

The demand function for such a factor of production will have 
the following properties. 

v« = f u (x, w u , w t , , w n ), (62) 

— 0 in some domain defined by w u , w lt • • , w n ) < 0, (63) 

dv u 

0 ^- < 0 in some domain defined by yf/{x, w u , w ir , w„) > 0, (64) 

where $ is so constructed as to form the locus of all critical points 
described above. 

Discontinuities in the Production Function 
I should now like to drop the assumption that the production 
function is necessarily continuous with continuous partial deriva- 
tives at every point This assumption has been challenged by 
many economists, who have alleged that production coefficients 

■ There is an interesting discussion of an exactly analogous equilibrium system in 
the famous paper of J Willard Gibbs , 1 The Equilibrium of Heterogeneous Substances 
Collected Papers, I, 55-349 ( 

1 An analogy is suggested to the case of items which do not enter into a consumer s 
budget until income increases or their relative prices decrease to some critical levels 
Be it noted that this phenomenon here described can occur even though there be m 
creasing marginal physical productivity, just as the budget case does not restrict the 
behavior of marginal utility It can be mathematically proved that this result is inde- 
pendent of the cardinal measure of product (utility) Another analogy is provided by 
the classical comparative cost doctrine— according to which a country specializes com 
ptetely in one good, the equilibrium is defined by a certain inequality between prices 
and marginal costs 
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are technically fixed, that some factors are “limitational,” some 
factors of production “must” be used in certain joint proportions, 
etc. These discontinuities, if true of the real world, have been 
thought by many economists to offer serious problems to the analy- 
sis of distribution and the pricing of the factors of production. 

It will be argued here that the fact of discontinuity offers no 
problems to the firm — on the contrary, its task is made much 
easier. As an obiter dictum I hold that it also offers no particular 
difficulty to the analysis of the wider problem of determining the 
prices of the factors of production with which each firm is to be 
confronted. As before, these are to be determined by general 
equilibrium analysis of supply and demand. 8 

As before, we have a production function relating maximum 
output to any given set of inputs: 

x = <p(v u • • •, v n ). (65) 

Precisely as argued in the first section this must be single-valued. 
However, it need not be continuous nor have partial derivatives 
at every point. In order that there be no contradiction to our 
definition of output as maximal, we must have 

A (p g 0, for A Vi ^ 0. (i = 1, • • •» n) (66) 

That is, as we increase all factors together, output cannot decrease, 
since otherwise product would not in the next position be maximal. 

For definiteness I assume that along an isoquant the production 
function contains only a finite number of points which do not 
possess continuous partial derivatives. At a point of discontinuity 
it is assumed that left-handed and right-handed partial derivatives 
exist. Of course, at a point of discontinuity there does not exist 
a uniquely defined plane tangent to the isoquant, but limiting 
direction cosines can be found for all planes which touch but do 
not cut the isoquant surface. It is also assumed that the isoquants 
are of a “single concavity,” to be defined later. The production 
function so defined is general enough to include the case of fixed 
coefficients of production, “perfectly complementary factors, 
limitational factors, etc. 

8 It is possible that within a narrow range the price may be indeterm •. 
special cases to coincidental inelasticity of supply and demand. 
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Note that all the inputs are to be regarded as independent 
variables. It is never true that they must be used in fixed propor- 
tions. It is true that it may be unprofitable not to do so, but this 
is a result of economic calculation. Even in the continuous case, 
given certain economic data, the factors must (on consideration of 
profitability) be used finally in given proportions, and in deter- 
mined amounts for each output. The only difference between 
these cases is that in the discontinuous case the required optimum 
point may be more obvious and less sensitive to changes in the 
prices of all factors of production. Be it understood that I do not 
minimize discontinuity. On the contrary, I should like to indicate 
the outline of a method which will handle both cases. 

Along an isoquant it is assumed that we have convexity as de- 
fined below. Consider any point on an isoquant, (vi 1 , ■ • r, 1 ). 
There must exist constants, (an 1 , • •, (not necessarily unique) 
such that 

E «,‘(t», 2 - r, 1 ) £ 0, (67) 

i 

where (»i*, • • • , r* 2 ) is any other point along the same isoquant. 
This merely says that there exist one or more tangent planes at 
each point which touch but never cross the isoquant. Similarly, 
at the second point there exist constants, (ai 2 , • • • , «n 2 ), such that 

t a?{v t x - v?) £ 0 , ( 68 ) 


- £ <*.*(■>.' - ».') a o. (69) 

l 

Adding (69) and (67) and changing sign, we get 

i.w-i.jyw-i!' 1 , -£1. <?°> 

i 

Since these are two arbitrary points, taking one as fixed, say 
(vi°, ■ • Vn), we must have along an isoquant 

E a<°A», S 0 ( 71 > 


for 


<p(y l ®, • • •, Vn°) = <p(vi° + A»i, • ", V n ° + Av n ), 



and 
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S AciiAvi s o. (72) 

Actually, it can be verified that the necessary and sufficient 
conditions imposed on (ai°, • • - , a „°) in order that (71) holds are 
the following inequalities : 


< Pi 8 < ^ 

<Pi L ~ Oil 0 = <pi s ' 


(73) 


where <p, L and <pi 8 are respectively the largest and smallest of the 
left- or right-hand derivatives at the given point. At any point 
where there is no discontinuity, the right-hand and the left-hand 
derivatives are identical; hence, the inequality converges to the 
equality, 

O^' 0 ViiP 1°| * > fy>°) rtjt \ 

«X° " y } 

All of the above are merely elaborations of the definition of con- 
cavity. It remains to show its relevance to the problem at hand. 


Conditions of Equilibrium 


Suppose we are given a set of prices (wi°, • • • , w„°) correspond- 
ing to which there is one combination of factors ( vi ° , • • • , v n °) which 
minimizes total cost for given output. As a definition of our 
minimum 


AC s 0 


(75) 


for 


Ax = 0, A Vi § 0. 


In other words, for any point (»j, — , v„) along the same isoquant 
we must have 


for 


£ w?Vi s £ Wi%° ( 76 ) 

1 1 

<p(vi, = <p{v 1 °, • • Vn), 


£ w°Avi § 0 


or 
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for 

At>j , •••,»„“ + Ar„) =3 tp(v?, ■ « tf* 0 ). 
Obviously from (73) this impties 


<p t s ^ w t ° <p t l 

tpi L ~ wf ~ <pi s ' ( 78 ) 

In the case that the maximum point is one at which continuous 
derivatives exist, we have the conditions of the previous section, 


jg. _ w,° 

V>i Wi u ’ 


(79) 


Thus, condition (78) is the general condition which includes (79) 
as a special case. 

Furthermore, it is still possible to place definite restrictions on 
the demand functions for the factors of production, output being 
constant. As before, 


v.° = /•(*, w,°, ••.ID, 0 ), (80) 

or 

At», = j\x, Wi° -f- Aw lf •, *«-° ■+■ A iv H ) — »,°. (81) 

From (72) we have 


for 


£ 

i 


Aw,Av t 


0 


(82) 


Ax = 0. 


Suppose that only the price of the &th factor varies. 
(82) becomes 

AwkAvk = 0 . 


Then 

(83) 


That is, an increase in the price of a factor cannot result in an 
increase in its use. Likewise, a decrease in its price cannot result 
in a decrease in its use. Still more generally, it can be stated that 
a change in the price of any number of factors cannot result in a 
change in amounts of all the factors in the same direction, i.e., 


£ AwjAVj SO, (r£ n) (84) 

for all Aw, S 0, not all Av, > 0; likewise, all Aw, S 0 implies not 
all AVj < 0. 
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Determinateness of Equilibrium 

Clearly our minimum cost is unambiguously determined even 
in the case of discontinuity. The task of the firm is easier because 
the penalty for not being at the minimum point is greater and more 
obvious. To be at the top of a smooth hill requires fine balancing 
and judgment. To find a maximum which is a cusp or spire is 
much easier. Moreover, such an equilibrium is extremely stable. 
It is sometimes called “too stable’’ equilibrium. In order to move 
it a very large change in prices may be necessary. In the limiting 
case where the increase of each factor in more than the optimum 
proportion results in no increase in product, the wage may fluctuate 
from zero to infinity without changing the proportions of factors 
employed for each output. And yet the problem is completely 
determined. 

It is curious to see the logical confusion into which many 
economists have fallen. The primary end of economic analysis is 
to explain a position of minimum (or maximum) where it does not 
pay to make a finite movement in any direction. Now in the case 
that all functions are continuous, it is possible as a means towards 
this end to state certain equalities on differential coefficients which 
will (together with appropriate secondary conditions) insure that 
certain inequalities will hold for finite movements. It is no exagger- 
ation to say that infinitesimal analysis was developed with just 
such finite applications in view. Unfortunately, the means have 
become confused with the ends, and so conventions and artifices 
are continually sought in order to be able to make statements con- 
cerning marginal equivalences. A case in point is the Marshallian 
marginal net productivity. It is only in the singular case that the 
production function is differentiable (i.e., possesses certain proper- 
ties of continuity) in a certain direction (i.e., for certain composite 
movements) that it is possible to employ this device; whereas the 
inequalities of condition (78) always give necessary and sufficient 
conditions, and include the marginal net productivity relations 
whenever these happen to be applicable. 9 

3 An example of another improper use of marginal net productivity curves is pro- 
vided by the treatment in Mrs. Robinson’s The Economics of Imperfect Competition 
(London: Macmillan, 1933), chap. xx. The reasoning is mathematically circular; t e 
so-called demand curves shift with changes in the wage of one factor ! This o ows rom 
the fact that they are drawn up as of appropriate amounts of the other factors. I hese 
appropriate amounts are necessarily a function of the prices of all factors o pro u 
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Although equilibrium is determinate for the single firm con 
fronted with prices of the factors of production discontinuities of 
the production function may introduce some difficulties into the 
general equilibrium problem whereby all firms and individuals 
together through the interplay of their demands and supplies de 
termine the prices with which each is confronted For discon 
tinuities may introduce perfect inelasticities of demand in certain 
domains, there emerges the possibility, however remote, of coma 
dental inelasticities leading to indeterminacy of price within certain 
limited ranges But this is outside the scope of the present 
discussion 

Maximization of Profit 

By now we have attained to the point where most discussions 
begin We have seen how to derive the locus of factor combtna 
tions which give lowest total costs for each output But as yet 
the scale of operations level of output, actually to be undertaken 
has not yet been determined This can only be done m the face 
of a new set of considerations, those relating to the terms at which 
different amounts of the commodity can be sold I take as a 
datum the maximum amount of gross total revenue which can be 
secured for each level of output This may be written 

R = R(x ) (85) 

Let us define profit, net revenue, as the difference between gross 
revenue and total expenditure, 

7 r = n(x, wi , w„) — R(x) — A — V(x wu . W") (86) 

Output will be optimum when profit is at a maximum Necessary 
conditions that this be so when all functions are differentiable are 


3ir _ dR 


(87) 

dx 

dx 

dx 

av d 7 R 


(88) 

ax 3 

dx 1 

dx 1 - 


Assuming that we have a regular relative maximum, this becomes 


3 R _dV 
dx dx ' 

(89) 

d'R d*v 

dx* dx 1 

(90) 



THEORY OF COST AND PRODUCTION 


77 

This is the familiar theorem that at the optimal output the 
marginal revenue curve must intersect the marginal cost curve 
from above. 10 

Regardless of whether or not the profit function has derivatives 
at each point our proper maximum conditions are 


At t < 0, 

for 

© 

AV 

<1 

(91) 

or 

A R AC 
Ax < Ax ’ 

for 

Ax > 0, 

(92) 

AR AC 

Ax > Ax ’ 

for 

Ax < 0. 

(93) 

The economic common sense 

of this is 

obvious. 11 



Equation (89) gives us a relation to determine optimum output 
re 0 . By substituting the value of x°, so obtained, in (5) we get a 
new set of demand curves for factors of production, drawn up as 
of a given total revenue curve. 

Vi = f(x°, Wi, • • • , w n ) = g*(w 1 , • • -,w n ). (i = 1, • • • , n) (94) 

It is possible to derive the positions of output, input, etc., in a 
more direct fashion by treating all the inputs as independent 
variables. 

Let 

n 

• • • , v n , w it • • ■ , w n ) — R[<p(v i, • • • , p n )] - A — YL wNi- (95) 
At a proper maximum 

|| <Pi-Wi = 0 , (* = 1, •••,») ( 96 ) 

and 

T = [ \Rx<Pij + Rxx<Pi<Pi3 

must form the coefficients of a negative definite quadratic form. 

10 The famous cost controversy may be interpreted as an argument over the impli 

cations of these conditions. , 

11 In the case of pure competition, when price is independent of sa es, ese co 
tions become 

p < — and P>—, respectively. 

Ax Ax 

It can also be shown that A pAx ^ 0; i.e., that an increase in prices cannot, celeris pan us, 
result in a decrease in quantity supplied. Hence, the firm s supply curve canno 
negatively inclined. 
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The results m (96) follow from (43) and (89) since for any 
output factor price must equal marginal cost times marginal phys 
ical productivity while for optimum output marginal cost equals 
marginal revenue These are n equations from which we can 
solve for the n factors oi production in terms of the n prices to get 
the demand functions of (94) 

*» = £’(™, ,a,) (t = 1, ,n ) 

Actually, it is known that 11 

dv % , r„ 

dw,~ S) ~ T * ( 9? ) 


where T x , is the cofactor of the element of the tth row and the jth 
column of the above matrix From the definiteness of the above 
quadratic form it follows that these last must form the coefficients 
of a negative definite form, i e , 


&<«■ 


g*) 

d[w, w y ) 


> 0 , 


etc 


(98) 


Indeterminacy in Purest Competition? 

If competition is pure” in the commodity and factor markets 
and the production function is homogeneous of the first order then 
it is a classical fact that the matrix T, which except for factors of 
proportionality is by the first assumption identical with the Hessian 
of the production function is singular Therefore, a regular maxi 
mum for the firm is impossible Unit costs being constant and 
demand being horizontal, there are only three possibilities price 
being everywhere greater than marginal cost, it will pay the firm 
to expand indefinitely i e , until competition ceases to be pure or 
if price is less than marginal cost no output will be produced or, 
finally, if price is identically equal to marginal cost the exact 
output of the firm will be a matter of indifference Thus what 
was usually regarded as the most favorable case for pure competi 
tion turns out to yield indeterminate output for each firm 

However, too much should not be made of this paradox Even 
though the output of every firm may be indeterminate their sum 

is Compare H Hotell ng Edgeworth s Taxation Paradox and the Nature of 
Demand and Supply Functions Journal of Political Economy XL 577-616 
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may be determinate, in the same way that the sum of two discon- 
tinuous functions may be continuous. For if many firms expand 
output, others will contract; for price will fall along the industry’s 
demand curve causing contraction. It has been argued, however, 
that competition will disappear since any one firm, with no ob- 
stacles to its expansion, will grow until it is a “significant part of 
the market,” at which time it will be able to affect the price of 
its product. 

This is similar to the familiar argument by which it is shown 
that decreasing marginal cost within a firm will lead to monopoly. 
Nevertheless, the analogy is faulty. For the demand curve of any 
firm is equal to the demand curve of the industry minus the supply 
curve of the remaining firms, already in the industry or potentially 
therein. This being the case, it is easy to show that under uniform 
constant costs the demand curve for a firm is horizontal even 
though it produces 99.9 per cent of all that is sold, Geometrically, 
the long-run supply curve of potential rivals is horizontal, and a 
horizontal curve subtracted laterally from any curve must always 
yield a horizontal curve. Economically if the firm were to begin 
to restrict output so as to gain monopoly profit, it would cease to 
sell 99.9 per cent of the output or even anything at all. Conse- 
quently, it would not attempt to do so, but would find its maximum 
advantage in behaving like a pure competitor. 

Thus, it remains true that the classical assumptions underlying 
pure competition are actually consistent. It is no accident that 
Walras and Marshall paid so little attention to the firm and so 
much to the industry. For under the purest conditions of competi- 
tion the boundaries of the former become vague and ill-defined, and 
also unimportant, since through reactions to prices the factors of 
production adjust themselves in the right proportions and in the 
right total amounts for the industry. 

Perhaps to a greater extent than in the case of increasing cost 
will the industry be subject to oscillation around its equilibrium. 
However, once displaced, output will tend to return to the correct 
value, so that the equilibrium may be said to be stable even though 
it would appear that a limitingly small individual would have no 
incentive to keep his output unchanged. However, the same is 
true of the situation where the price of the same good in two 
markets just balances the transportation costs necessary to transfer 
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a unit of the good from one market to the other. Any one arbi- 
trager has no special inducement to ship more or less than he is 
actually doing, nor even to ship just the amount that he is doing. 
Yet the equilibrium is stable; for if the right amount were not to 
flow, the spatial price differential would change so as to return the 
system to its previous position. To an infinitely near-sighted olive 
the bottom of the cocktail glass appears level, and it no doubt 
regards itself as being in neutral equilibrium. Actually, the equi- 
librium is stable as any finite movement will show. 

Discontinuous Case 

In the general case where the production function is not neces- 
sarily differentiable we have still for a maximum 

Ar 5 0, for Av, § 0. (99) 

As a special case of this, for a movement of one factor, all others 


constant, we must have 



AR Ax _ 

— -r— < W„ 

Ax Av, 

for 

Ar, > 0, A v, = 0, (100) 

ARAx „ 

Ax Av t W " 

for 

Av, < 0, Av } = 0. (101) 

That is, it must pay to move neither backward nor forward. 
Moreover, consider a given set of factor prices ( ivi ■ **, w, # ). 


Corresponding to this there is a set of factors (tfi°, - • f»°) which 

give a maximum profit. In order that this be a real maximum 

ROW, • • ■>.')] - a - £ w.w a .ROW, • ■ f. 1 )] 

1 

- A - Z ik.V. (102) 
1 

Consider a set of prices (wi 1 , • • • , w a l ) for which the (arbitrary) 
point (ui 1 , • • - , vj) is the point of maximum profits. Then 



THEORY OF COST AND PRODUCTION 
Now adding (102) and (103) and canceling terms we get 

£ (w, 1 - w?)(v> - v?) s 0, 

1 

or 

£ AWiAvi s 0. (105) 

For only the jth. price varying, this becomes 

AwjAvj ^ 0. (106) 

In other words, a decrease in a price cannot result in a decrease in 
the factor used. Further possible interpretations are of course 
possible. 

As before, the general case is simpler than that of the special 
continuous case. Moreover, the method of finite increments ap- 
pears to be mathematically simpler in the sense that it is possible 
to state the qualitative direction of changes without solving in- 
versely for the actual demand functions. 

The method employed here is that which underlies Le Chatelier’s 
principle in physics. By making use of Professor E. B. Wilson’s 
suggestion that this is essentially a mathematical theorem ap- 
plicable to economics, it has been possible to gain increased gener- 
ality without increased complexity and emptiness. 

It is important to realize just how much content there is to a 
particular economic theory. As far as the single firm is concerned, 
everything fundamental which can be said is implied in the state- 
ment that in equilibrium there must exist no movement by which 
the firm can improve its profits; i.e., Air = 0 for all movements of 
variables possible to the firm. In the case of continuity certain 
necessary relations of differential coefficients (marginal equiva- 
lences) are implied. Moreover, assuming certain specific forms to 
our functions (independence of prices, etc.), it is possible to deduce 
formally the implications of an equilibrium position (e.g., negative 
demand curves, positive supply curves, etc.). It appears that no 
more than this can be validly stated. 

External Conditions of Equilibrium 

Thus far we have been discussing the conditions of equilibrium 
imposed from within the enterprise by its desire to maximize 


8i 

(104) 
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profits This has resulted in certain marginal inequalities Econ 
omists have not stopped here, but have also tried to analyze certain 
conditions of equilibrium resulting from inter competition among 
firms That is to say, they have tried to state conditions upon 
the market situation (obstacles) with which each firm will be con 
fronted In particular, they have been interested in the determini 
tion of the rate of profit which any firm can earn 

It has often been argued that not only must price (average 
revenue) under "perfect" competition equal marginal cost, but also 
it must be equal to average cost so that net revenue will be zero 
This second condition has not always been recognized as being of 
an entirely different nature from the first In this section an 
attempt will be made to distinguish between them It is hoped 
that in so doing it will be possible to put the famous * adding up” 
problem and homogeneity of the production function in its proper 
place 

In the beginning, to avoid confusion no use is made of the 
term “perfect ’ competition The term "pure” competition will be 
understood to mean that the demand curve for any producer is 
infinitely elastic that his sales cannot affect prices The problem 
of discontinuity is ignored Under these conditions the internal 
conditions of equilibrium are that marginal cost be equal to mar 
gmal revenue (price) and hence that the marginal physical produc 
tivity of each factor times the sales price of the good be equal to 
the price of the respective factor 


dR = = 8C 

dx p dx' 


( 107 ) 


dC * 

W % = -r-<p> = p<px 


( 108 ) 


These are marginal conditions and say nothing about the totals 
wwJrt't'i kt. \fL by defimtmn- of the long, run’ as that 

period in which all costs can be avoided by going out of business, 
that the firm must never have a negative net revenue As a 
condition internally imposed we know that 

xgO, (» 


or 


i?(x) S T, w,Vi 


( 110 ) 
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Some writers, by a curious play on words, have been able to 
arrive at the condition that average cost equals price. A typical 
form of .the argument is as follows: (1) a firm will equate marginal 
cost to price; (2) it will also try to minimize its unit cost; (3) at the 
point of minimum unit cost average cost equals marginal cost; 
(4) hence, average cost must equal price (average revenue) and 
profits will be zero. 

Stated explicitly, it is obvious that the second statement is false. 
The play on words arises from the confusion of the condition that 
for each and every output total and unit costs must be a minimum 
with the statement that of all outputs possible that one is chosen 
at which unit cost is the lowest. The first implies the valid condi- 
tion that the marginal productivity of the last dollar in every use 
must be equal. The second implies the invalid (from internal con- 
siderations) condition that output be determined irrespective of the 
sales price. 

On the other hand, some have tried to argue as follows: (1) the 
production function from the nature of things must be homogene- 
ous of the first order; (2) by Euler’s theorem it follows that]' if 
factors are paid “according to the marginal productivity principle,” 
product will be exhausted. ,, 

As an example of the lack of integration between the theory of 
production and that of cost, we find many writers asking whether 
product will be exhausted at the same time that they have already 
agreed that price equals average cost, and total revenue equals 
total cost. Of course, the latter condition is merely another way 
of stating the former. 

Once the problem is properly stated as that of determining the 
relation between gross revenue per unit and expenditure per unit, 
it should be reasonably clear that this cannot be determined by 
the properties of the production function alone, but must depend 
upon the marketing situation of the firm, which in turn depends 
upon the competition of other firms. It is quite clear that as far 
as the single firm is concerned it is possible that it be making huge 
profits regardless of the homogeneity of the production function. 
This condition is neither necessary nor sufficient to the exhaustion 
of the product. If the production function were homogeneous, but 
demand were sufficiently favorable, of course product woi 
be exhausted — even under pure competition. 
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The problem of homogeneity of the production function is one 
about which much controversy has raged. It has long been held 
on philosophical grounds that product must be a homogeneous 
function of the first order of all the variables, and that if this is not 
so, it must be either because of "indivisibility” or because not all 
"factors" have been taken into account. With regard to the first 
point, it is clear that labeling the absence of homogeneity as due 
to indivisibility changes nothing and merely affirms by the implica- 
tion that "indivisibility" does exist, the absence of homogeneity. 

With respect to the second point, we may reverse the Aristo- 
telian dictum and affirm that anything which must be true self- 
evidently (“philosophically”), intuitively— i.e., by conventional 
definition of the terms involved— that such a principle can have 
no empirical content. It is a scientifically meaningless assertion 
that doubling all factors must double product. This is so not 
because we do not have the power to perform such an experiment; 
such an objection is of course irrelevant. Rather the statement 
is meaningless because it could never be refuted, in the sense that 
no hypothetically conceivable experiment could ever controvert 
the principle enunciated. This is so because if product did not 
double, one could always conclude that some factor was "scarce ” 19 

It is useful, I believe, to avoid the expression “factor of pro- 
duction" entirely. This has been used in at least two senses, 
neither of which is quite satisfactory. First, it has been used to 
denote broad composite quantities such as “labor, land, and capi- 
tal.” On the other hand, it has been used to denote any aspect 
of the environment which has any influence on production. I sug- 
gest that only “inputs" be explicitly included in the production 
function, and that this term be confined to denote measurable 
quantitative economic goods or services. The production function 
must be associated with a particular institution (accounting, 
d ecis'i on - m ak*i ng unit, etc.';, an6Tmr^t’Defirawr£ipm>t/i’2frfy 'tniRfift. 
circumstances pertaining to this unit. Other definitions are of 

»* Any function whatsoever in n variables may be regarded as a subset of a larger 
function in more than n variables which is homogeneous of the first order It is because 
this is true of any arbitrary function that this generalization is useless For example, 
the volume of a sphere not being a homogeneous function of the radius, a new factor can 
be defined whose "scarcity” will explain this fact Like a false proposition in logic from 
which every proposition can be derived, this overgenerahty renders such a convention 
useless. 



THEORY OF COST AND PRODUCTION 


85 

course possible, but it is clear that our previous conditions cannot 
be expressed in terms of them. 

So defined, the production function need not be homogeneous 
of the first order. If really homogeneous, marginal costs would 
always be constant. 14 It is indicative of the lack of integration 
mentioned above that many writers assume U-shaped cost curves 
in the same breath with homogeneity of the production function. 

In reality, it is not on philosophical grounds that economists 
have wished to assume homogeneity, but rather because they were 
afraid that, if they did not do so, contradictions would emerge to 
vitiate the marginal productivity theory. This is simply a mis- 
conception as will be indicated below. 

Our discussion can be confined to the relation of total cost and 
total revenue. The implications for marginal productivities can 
then be indicated. 

It is clear that the firm does not of its own volition act so as to 
equate average gross revenue to average expenditure, although it 
may in the long-run prevent average revenue from being less than 
average cost by going out of business. 

It is only through the competition of new firms that the demand 
curve of the firm may so shift downward as to make the position of 
maximum profit one at which total gross revenue equals total 
expenditure. 

Reserving for later investigation the conditions under which 
the demand curve will so shift, let us investigate the implications 
of the assumption of zero net revenue. Given 


and 


dR _ dC 
dx ~ dx 


( 111 ) 


7T = R - C = 0, 


( 112 ) 


it follows that the demand curve must be tangential to the total 
unit cost curve. 

In the case of pure competition the demand curve is a hori- 
zontal line, and under the conventional assumptions as to the shape 
of the cost curve, the tangency will be at a unique point, that of 

14 It follows from Euler's generalized theorem on homogeneous functions that the 
Hessian of a homogeneous function of the first order is identically zero. vious y, 
stable equilibrium for a firm under pure competition is impossible in these circumstances. 
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minimum average cost. This follows from the fact that average 
cost must be equal to marginal cost, and that the latter is rising. 
dR ^ 8C C 

dx~^^dx = x = minimum average cost. 18 (113) 

Of course, 

px = E Wt by assumption, ( 114 ) 

and 

dC 

Wi ~dx v *~ (115) 

Hence, 

x = £ <PiV>. (116) 

This looks superficially like Euler’s theorem, but it is not. For 
Euler’s theorem is an identity and should be written 


* - S (117) 

\ 

whereas this is merely a condition of equilibrium holding at a 
single output. 

Moreover, in the case that the demand curve may be negatively 
inclined, we get the more general formulation 

OR 

x = ~iv.v. (118) 

P 1 

This formulation differs from that of Walras, Wicksell, and 
Hicks in that the condition of minimum unit cost is derived as a 
theorem from the condition that total revenue equal total expendi- 
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ture. It is this last condition and the forces which lead to it that 
are of importance, and not the question of homogeneity at all. 

It is quite clear that in the real world net revenue is not zero 
for all firms, nor is it tending towards zero. This is true under 
pure competition as well as impure competition. It is clear that 
this residuum must be “due” to something, and it may be labeled 
by any name we please (rent to institutional advantage, etc.). 

The existence of this residuum does not imply any indeter- 
minacy whatsoever. Optimal output, revenue, expenditure, and 
the difference between these two terms are all fully determined. 
Of course, under ideal conditions this residuum will be capitalized 
by going concern valuations. Economists, remembering the class- 
ical solution of the problem of distribution in which the shares of 
two or more factors were simultaneously residually determined, 
have swung too far in the opposite direction. The attempt to 
“explain” all residua in terms of marginal productivity analysis 
applied to a wider production function can always be done by 
convention, but is devoid of empirical content. 

It is convenient to have an analytic definition for the case in 
which “competitive” conditions between firms are such that the 
demand curve of any firm will always shift until net revenue is 
equal to zero. The term “free entry” may be defined as the con- 
dition under which this holds. Of course, this classification cuts 
across that of pure or impure competition. Thus defined, free 
entry is a condition to be looked for empirically, rather than one 
imposed upon the data a priori. 

I suspect that part of the economists’ intuitive desire to define 
a category of “profits” as distinct from “rent to institutional ad- 
vantage” stems from a subconscious remembrance of the old- 
fashioned distinction between “natural” and “contrived” scarcity. 
Perhaps too little is heard today of this distinction, which has 
important connotations for social policy and welfare economics. 

Summary 

In conclusion, a summary formulation of the analysis is pre- 
sented to replace the famous three Theorems of Walras. These, 
aside from being redundant and ambiguous, are not each of the 
same order of meaning. The formulation here is for the continuous 
case and in terms of marginal equivalences, but the more general 
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formulation in terms of marginal inequalities readily suggests itself 
1 hroughout a firm with given production function, factor prices 
and demand conditions is posited The general case of pure or 
impure competition is considered 

I The first fundamental assumption is that the firm tries to 
maximize its profits and from this the following internal conditions 
of equilibrium can be deduced 

A Any output which is produced must be produced with factor 
combinations such that total cost is a minimum As a result of 
this we have two corollaries 

1 The marginal productivity of the last dollar must be equal 
in every use 

2 The price of each iactor of production must be proportional 
to marginal physical productivity, the factor of proportionality 
being marginal cost 

B That output will be selected which maximizes net revenue 
total cost being optimally determined by the previous conditions 
This implies 

1 The equality of marginal cost and marginal revenue the 
slope of the latter being the smaller 

2 In combination with the previous conditions under A wc 
also have the marginal value productivity of each factor equal to 
its price the first term being defined as marginal revenue times 
marginal physical productivity 

3 Total cost must not exceed total revenue, since otherwise 
the firm would go out of business 16 

II If we impose by arbitrary assumption or hypothesis the 
external conditions that entry be free l e , that total revenue be 
equal to total cost, then 

A Product will be exhausted by definition 

B The demand curve must be tangent to the unit cost curve 
In the case of pure competition this implies minimum average cost 

i» if the enterprise under consideration owns productive resources which have a sale 
value on the market it is necessary that net revenue be at least as large as the sale 
(liquidation) value of these resources As an internal condition of equilibrium r S sale 
value of owned resources Alternative uses elsewhere introduce opportunity costs 
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Aside from the above general conditions of equilibrium, it has 
been shown how the definition of an extremum position may be 
utilized (a) to evaluate the direction of change of variables with 
respect to parameters (prices) taken as data, regardless of condi- 
tions of continuity, and (b) to develop reciprocal relations imposed 
upon demand derivatives, where these exist. 



CHAPTER V 

THE PURE THEORY OF CONSUMER S BEHAVIOR 

If ONE were looking for a single criterion by which to distinguish 
modern economic theory from its classical precursors he would 
probably decide that this is to be found in the introduction of the 
so called subjective theory of value into economic theory This 
revolution m thought broke out almost simultaneously along three 
fronts and with it we associate the names of Jevons Menger 
and Walras 

Moreover, it is this part of economic doctrine which has proved 
to be the center of so much controversy Indeed many critics of 
the orthodox tradition have identified the whole body of economic 
theory with the belief in that abstraction homo economicus In 
fact, many economists well within the academic fold would sepa 
rate economics from sociology upon the basis of rational or irra 
tional behavior where these terms are defined in the penumbra of 
utility theory It would seem extremely important therefore to 
know clearly what is contained in the conventional utility analysis 
if only to understand the consequences of denying its validity 

Evolution of Utility Concept 

The concept of utility may be said to have been undergoing 
throughout its entire history a purging out of objectionable and 
sometimes unnecessary, connotations The result has been a much 
less objectionable doctrine but also a less interesting one With 
out doing justice to the subject these developments may be sum 
marized in a brief way It must be clearly understood however 
that these are the movements of the pioneers of thought Their 
work appears chiefly in academic journals and has little influenced 
the general class of economists 

(a) One clearly delineated drift m the literature has been a 
steady tendency towards the rejection of utilitarian ethical and 
welfare connotations of the Bentham Sidgwick Edgeworth va 
nety These matters still receive consideration in questions of 
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normative policy, but they are clearly separated from the problem 
of consumer’s behavior. Although especially marked in regard to 
inter-individual welfare comparisons, there is the same tendency 
in connection with the analysis of the behavior of a single indi- 
vidual. Only as obiter dicta do we find in the modem literature 
discussions of particular pleasures as being pure or impure, etc . 1 

(b) Concomitantly, there has been a shift in emphasis away 
from the physiological and psychological hedonistic, introspective 
aspects of utility. Originally great importance was attached to 
the ability of goods to fill basic biological needs; but in almost 
every case this view has undergone extreme modification. At the 
same time, there has been a similar movement away from the con- 
cept of utility as a sensation, as an introspective magnitude. It is 
not merely that the modern economist replaces experienced sensa- 
tion or satisfaction with anticipated sensation, desire, according to 
the now familiar distinction between ex post and ex ante analysis. 
But much more than this, many writers have ceased to believe in 
the existence of any introspective magnitude or quantity of a 
cardinal, numerical kind. With this skepticism has come the recog- 
nition that a cardinal measure of utility is in any case unnecessary; 
that only an ordinal preference, involving “more” or “less” but not 
“how much,” is required for the analysis of consumer’s behavior. 

Indeed, so far has the reaction gone that it is the belief of many 
that nothing remains but an empty convention. Others, who do 
not admit the hollowness of utility, have in some cases embraced 
a formulation of the analysis which is meaningless in any opera- 
tional, empirical sense . 2 The result is a curious jargon of dogmatic 
precepts. 

Thus, the consumer’s market behavior is explained in terms of 
preferences, which are in turn defined only by behavior. The re- 
sult can very easily be circular, and in many formulations un- 
doubtedly is. Often nothing more is stated than the conclusion 
that people behave as they behave, a theorem which has no em- 

1 The Cambridge tradition is perhaps an exception in this respect, although even 
here the change in emphasis is notable. 

2 Cf. Alan R. Sweezy, "The Interpretation of Subjective Value Theory in the 
Writings of the Austrian Economists,” Review of Economic Studies, vol. I, no. 3 (1934), 
pp. 176-185. 
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pineal implications, since it contains no hypothesis and is consistent 
with all conceivable behavior, while refutable by none . 3 

Nevertheless, as we shall see, the modern utility theory with all 
its qualifications is not in a technical sense meaningless It is a 
hypothesis which places definite restrictions upon demand func- 
tions and price-quantity data, these could be refuted or verified 
under ideal observational conditions One should have thought 
that these empirical implications would have been the sole end of 
the theorists who have concerned themselves with these matters. 
Strangely enough, means and ends have been so confused that only 
a small fraction of the literature has been concerned with this 
problem even indirectly, moreover, in this there are scarcely half 
a dozen papers in which valid demand restrictions have been 
developed. 

I do not propose to defend the fruitfulness of these empirical 
restrictions The extent to which they satisfy and unify the fac- 
tual behavior of consumers cannot be settled by argumentation 
However, for better or worse the theory of utility has occupied an 
important position in economic thought for the last half century 
This alone makes it desirable that its meaning be clearly understood 

Progression in Mathematical Thought 

From the beginning mathematical methods have figured promi 
nently in the analysis of utility. Despite the unfavorable reaction 
produced among some writers who felt that a spurious precision 
was implied by the use of these supposedly “exact” tools, it is 
demonstrable from the literature that symbolic methods have been 
an aid to clear thinking and the advancement of the analysis 4 
For those who used this abstract language were forced to formulate 
their concepts unambiguously, and so the way was opened for 
and* o/iaUfina/Jon.. 

It is interesting, therefore, to review very briefly the history of 
some mathematical aspects of the theory to bring out clearly the 
progression in thought through time. 

» Still another “meaningless ' theory is held by those writers who speak of behavior 
in terms of the economic principle, regardless of whether any empirical behavior related 
to it exists 

* Thus Edgeworth's Mathematical Psychics offers penetrating insight into the views 
commonly held in his day 
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As early as 1854, Gossen is credited with presenting what is 
essentially marginal utility. He assumed this to be a decreasing 
linear function of the quantity of any particular good. The utility 
function would therefore be as follows: 

U = K + {aix 1 — bvc^) -f (a 2 x 2 — b 2 xP) + ■ ■ • . (1) 

Jevons, writing fifteen years later, proposed that the utility 
function be written as the sum of utilities pertaining to each good 
separately. 

U = Vi(xi ) + V 2 (xz) + - +• V n (x n ), (2) 

where the functions V { obey the law of diminishing marginal 
utility. 5 Specifically, 

V/fri) > 0 , (. 3 ) 

Vi"(xi) < 0 . 

In his Mathematical Psychics (1881), Edgeworth, going further 
than Jevons, suggested that the requirement that utility be a sum 
of functions pertaining to each good was an unnecessary and indeed 
unjustifiable assumption. He proposed, therefore, that the utility 
function be written in the form 0 

U = <?(x u ■ ■ ■, Xn), (4) 

where <p is any joint function of the quantities of all goods, and 
where 

w-aSr,* 0 - (S) 

By the end of the nineteenth century many writers, notably 
Pareto, had come to the realization that it was an unnecessary and 
unwarranted assumption that there even exist utility as a cardinal 
magnitude. Since only more or less comparisons are needed for 
consumer’s behavior and not comparisons of how much more or 
less, it is only necessary that there exist an ordinal preference field. 

6 Walras and Marshall also made the assumption that utility may be written as 
in ( 2 ). In the case of Marshall, as will be discussed later, it is not clear whether he 
really intended to be taken literally when making the assumption that utilities are inde- 
pendent, or whether he regarded this as an approximation for small movements under 
certain conditions. 

6 Professor Irving Fisher is also credited with the independent discovery c 
possibility at a later date. 
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For any two combinations of goods respectively (*i°, , Xn o) an( j 

C*! 1 . • or for brevity, (X°) and (X 1 ), it is only necessary 

that the consumer be able to place them in one of the following 
mutually exclusive categories 

a (X°) preferred to (X 1 ) 

b (X 1 ) preferred to (X°) (6) 

c (X°) and (X 1 ) equally preferred or indifferent 

For convenience, we may attach a number to each combination 
this is assumed to be a continuous differentiable function This 
function (or rule of numbering) may be written 

V = q»(X) = <p(pc u » *») (7) 

It is so constructed that the following three conditions correspond 
to the above three respectively 

a' *(X>) < <p(X°) 

b' *(X«) < ^(X 1 ) (8) 

c' HX*) = v>(X l ) 

<p may be designated as a utility index The one parameter 
family of loci defined by 

*-(*i , *») = c, 

where C is regarded as a parameter, are designated as indifference 
loci 

It is clear that any function 

U = F(<p), F'{<p) > 0 (9) 

defined by any monotonic transformation of <p, is also a utility 
index For 

^(X 1 ) $ <p{X a ) implies U{X l ) t U(X°), respectively 

The converse also holds Thus from any one utility index ah 
others can be derived by a suitable functional transformation 
To summarize, our ordinal preference field may be written 

v = fj>(x„ ,*„)] F’M > o (10) 

where <p is any one cardinal index of utility 

It is clear that the choice of any one numbering system or 
utility index is arbitrary The indifference loci are left unchange 
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by any alteration of the tags attached to each, provided ordinal 
relationships are maintained. In order, therefore, to avoid the 
asymmetry of employing any one favored utility index, many 
writers (Pareto, W. E. Johnson, Hicks and Allen, et al.) have sug- 
gested that a notation be employed which is dependent only upon 
the invariant elements of the ordinal preference field, namely, the 
indifference loci. 

The direction cosines of the tangent plane to an indifference 
locus at any point must be in determined ratios. Given any one 
utility index, we have 



We may take as given the invariant slope functions 

-(ilL„ -■*<*•• •••<*■>• = •••■»> < 12 > 

These are invariant under any change of utility index, for 

2 pi U i{ X 1, * * * ) X n ) 

* ~ Ui(x u ■■■,*») 


However, if we consider more than two commodities, the func- 
tions 1 R i cannot be all arbitrarily chosen. In order that there 
exist an ordinal preference field of the type described above they 
must, as Professor Fisher has pointed out, satisfy the following 
integrability conditions: 

*22/ - W hRi 1 ' = l Rj - IP 1 ' iRih (14) 

so that the following so-called Pfaffian 

dx 1 + 1 RMx 2 + - • • + Wdxn (15) 

admits of an integrating factor y(xi, • • • , x„) , and may be con- 
verted into the exact differential 

d<p = yd%i + ( y 1 R 2 )dxz + • • • + ( 7 1 R n )dx n 

= tpidxi -}- tpidxi + — + tpndx n , 


F (pi{X\, ' , x n ) 


F'<pi{x u 
<Pi(x 1 , • 


' > Xji) 
' ) Xri) 


<Pl(Xl 


' , Xn) 


(i = 2, •••,») (13) 


( 16 ) 
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= 7 , 

and 

ifi, = (y'R') 7 (17) 

from this stage it was but one small step to the rejection of 
the integrabiUty conditions Thus, Pareto, Hicks and Allen and 
others have been content to start out with the assumption of a 
planar element embodying indifference directions at each point 
The latter two writers call these the respective marginal rates of 
substitution between the ith and first goods These are written 
as m (12), but the functions are not required to satisfy the partial 
differential equations of integrability presented in (14) 

l R* = **»(*, , x n ) (t = 2 n) (18) 

The Demand Functions as a Govl 
We have seen an account of the transformations which the 
preference field has undergone through time However, nothing 
has been said as jet as to the use to which these concepts are put 
m the explanation of consumer's behavior This w e must now do 
in order to investigate the meaning — in the technical operational 
sense — of the various hypotheses 

Following traditional assumptions of the pure theory of con 
sumer’s behavior, we consider a single idealized consumer buying 
goods and services per unit time m a market w hose prices he cannot 
appreciably affect The selling of personal goods and serv ices may 
be at times regarded as negative purchases For present purposes 
each good and service is taken as clearly defined, homogeneous 
divisible, etc Let us designate all goods and services (?i , x n ) 

with respective given prices ( pi p„) Total expenditure or 

income is defined as 

/ = Tipi + Xlpj + + x n pn = Z P& (19) 

As of any giv en total expenditure and a given set of prices, it is 
assumed that our idealized individual will select some determined 
amounts of each and ev ery good (Of course, the amount of some 
goods may be zero ) That is to say, the quantity of each good is 

1 0f course F{v)f i « also an integrating factor 
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a function of all prices and income. 

tfi = h l (pi, I) 

x 2 = h 2 (p u ■••,p n ,r) 


X n — Jl n (p\, 


‘ > Pni !)• 


( 20 ) 


These are the general demand functions. Their derivation is the 
whole end and purpose of our analysis of consumer’s behavior. As 
has been reiterated again and again, the utility analysis is meaning- 
ful only to the extent that it places hypothetical restrictions upon 
these demand functions. This is the point of view from which we 
shall proceed. 

The Marshallian partial equilibrium demand functions for the 
first good would be, of course, 

*i = h l (Pu p2, ■ p n , I) - D'ipi), (21) 

where all other prices and income are held constant by ceteris 
paribus assumptions. A meaningful restriction upon our price- 
quantity data would be the hypothesis that an increase in one 
good’s price will, ceteris paribus, result in a decrease in its quan- 
tity; i.e., 

= hi* < 0 . ( 22 ) 

dpi 

Is this derivable from the utility analysis? Can anything be said 
about dxi/dpj, the change in the quantity of one good when some 
other price varies? What about dXi/dl, the rate of change in the 
quantity of the Ah good with respect to a change in income? 
These are the questions whose answers we must seek. 


Conditions of Equilibrium 

It is not necessary that the demand curves be derived for each 
of the preference fields defined by (1), (2), (4), and (10), respec- 
tively. Fortunately, the last includes all previous formulations 
as special cases. I start out from the general case of an ordinal 
preference field, later considering the meanings of the special cases. 

The utility analysis rests on the fundamental assumption that the 
individual confronted with given prices and confined to a given total 
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expenditure selects that combination of goods which is highest on his 
preference scale This does not require (a) that the individual 
behave rationally in any other sense, (b) that he be deliberate and 
self-conscious in his purchasing, (c) that there exist any intensive 
magnitude which he feels or consults 

Our problem, therefore, is the comparatively simple one of 
finding a maximum for 

U “ *t>(*i, .*»)] (10) 

subject to 


£A*-/, 


(19) 


where {pi, , p n , I) are each preassigned parameters 

This is a constrained maximum problem, since equation (19), 
familiarly termed the budget equation, must be satisfied This 
restricts the choice of quantities Without such a restriction the 
individual could presumably purchase an unlimited amount of 
goods up to the point of satiation But m point of fact, goods are 
not all free, with a fixed income the more of one good which is 
bought, the less must be consumed of another 

In the Appendix it is shown that we must have as a necessary 
condition for such a constrained relative maximum 

U, + \p i = Q, (**1, ,n) (23) 

where X is a so called Lagrangean undetermined multiplier This 
may also be rewritten in either of the two following equivalent 
forms 

T Hi- (24) 


(25) 


= Mz = -x 

Pi P2 Pn 

This means that in equilibrium the ratio of the marginal utilities 
of two goods is equal to the ratio of their prices, i e , marginal 
utilities are proportional to prices 

It is clear from the formulation of (24) that it does not matter 
which utility index we use, for 
Ui = FV. 


(* = !.' » «) 


( 26 ) 
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(27) 


Our conditions of equilibrium yield the same solution , therefore, re- 
gardless of our choice of a particular utility index. It is as meaning- 
less to argue that one particular utility index is really the true 
measure of utility as it is to argue that the earth really revolves 
about the sun and not vice versa. Only in terms of observations 
other than those envisaged in our market place can a cardinal 
utility magnitude be defined. 

The formulation of (25) gives rise to the familiar interpretation 
that in a maximum position the marginal utilities of the last dollars 
spent for each and all commodities must be identical. This mag- 
nitude ( = — X) has been termed the marginal utility of money, 
or better still, the marginal utility of income. It will be noted that 
it is not invariant under a change of the utility index, and so for 
an ordinal field no significance attaches to its magnitude, nor to 
the rates of changes of its magnitude with respect to any variables. 

Employing the notation of indifference loci, the same conditions 
may be derived. For from (13) 


* , %n) 

Ui(x 1, • • •, X n ) ' 


(f = 2, •••,«) 


Therefore, the conditions of equilibrium of (24) may be written 


l R*(xu ■ ■ • , x n ) - £ = 0. (i = 2, • • • , n) (28) 

This is the familiar tangency of the budget plane with the indiffer- 
ence locus passing through the point of equilibrium. Figuratively, 
the consumer moves along the budget plane until he attains the 
position lying highest on his preference scale, which must in the 
continuous case be a position of tangency; for if the budget plane 
crossed the indifference locus, he could advance to a still higher 
position. 

We have stated our equations of equilibrium in several different, 
but mathematically equivalent ways. The formulation of (23) is 
one in which the symmetry of all variables is maintained, and so 
for definiteness we may concentrate upon it. Our budget 
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of (19) must also be satisfied, and so our full conditions of equi- 
librium can be written 

Ui(x u , x«) + \p t = 0, ( 1 = 1 ,..., n ) 

piXi + ptXz 4 - • + p n x n —1=0. ( 29 ) 

These conditions of equilibrium correspond to the set (1) of 
the second chapter. We wish to derive from them our demand 
functions, 

*» - WPl, ' , Pnt I), (30) 

which correspond to the equations (2) of the second chapter 
Prices and income arc regarded as data for this analysis, and we 
should like to know how our equilibrium quantities vary with 
changes in these parameters 

Our conditions of equilibrium are {n 4- 1) in number, and in- 
volve 2 (» -f !) unknowns, namely (— X, x u , v n , p lt • •, p„, J) 
Avoiding now all problems of multiplicities of solution, we may 
assume that (n + 1) of our variables may be solved for in terms 
of the remaining (« 4- 1). In particular (— X, x it • , x n ) may be 
each solved for in terms of (p u , p n , I). Hence, we get the 
following functions- 

X, = h’(Pu (t => 1, , n) (30) 

and 

(-X) =/(/>., . pn, 1) (3D 

Thus, our demand functions can be derived from our conditions 
of equilibrium. We have also introduced a new variable (— X), 
the marginal utility of income, which could, of course, have been 
eliminated, but only with a loss of symmetry. Had we employed 
one of the other equivalent sets, such as (28), we should have had 
n equations between (2 n -f- 1) variables, and so our ft quantities 
could have been evpressed as before in terms of the (» + 1) prices 
and income parameters 

Displacement or Equilibrium 

We have counted our equations and unknowns, and found them 
to be equal in number. Subject to certain restrictions, this assures 
ns that all our equilibrium variables are determined. There is a 
temptation to stop at this point and rest content with these 
achievements. 
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In view of all that has been said in earlier chapters, it requires 
no further argument to show that our task has hardly begun. 
There remains the sizable problem of deducing the qualitative 
properties of our demand functions from our knowledge of the 
properties of our equilibrium maximizing equations. 

To do so, we employ the same methods outlined in the second 
and third chapters. Let us write the total differential of the equi- 
librium equations (29) 

Ui\dx\ -J- U +-••-}- U in dx n -f- pid\ 

= ( — X)dp„ (i — 1 , • ••,«) 

pidxi + p'TLx'i -j- • * • -f- p n dx n 

= dl — (x\dp\ -f- x%dpi + • • • -j- x n dp n ), 
or 

23 Utjdxj -f- pid\ — (— \)dp„ (i = 1, ■ ■ ■ , n) 

1 (32) 

n n 

23 pjdxj = dl — 23 x k dpk . 

i i 

These are (n -f- 1) linear equations in the in ■+ 1) unknowns 
[dx\, - • •, dx n , d(— X)J. Their solution may be indicated as 
follows: 


dxj — 

d{~\) = 

where 


23 (~ 'h)D 1J dp i +{dT - 23 x h dp>)D n +uj 
1 1 

D 

— L 22 (~ T)D i in+ idpidr {dl — 23 Xf:dpk)D n+ i <n+ i] 

i i 

D 


(33) 


D 


U, y| 

Pi 

Pi 

0 


U n 

U u ■■■ 

U In 

Pi 

U, I 

Uk ■■■ 

Ul n 

Pz 

Uni 

u n 2 ••• 

U nn 

Pn 

Pi 

P 2 

Pn 

0 


(34) 


and Dij indicates the cofactor of the element of the 7th row and the 
/th column. 
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The formulae in (33) give the changes in our unknowns for any 
changes in the parameters, prices, and income. As special cases 
the following partial derivatives may be evaluated: 


dx, (- \)D, } - x,D„^ , . 

, (>,j = i , ■■•,») 

dx, __ Dn+lJ 

dl ~ D * 
where, of course, 


Also, 


d*> __ hJ dx, __ , 

dpi h " dl ~ hl 


d(~ X) _ - [(- \)Di. nj-1 - 

dp . “ D 

3(— X) — Pn+-).»-H 

5/ D 


(35) 


(36) 


It is convenient to consider a compound term introduced first 


by Slutsky, defined as follows: 




K - ^ 4- v dxj 
K,t dp, + x * di • 

{ij = 

l, ••*.») 

(37) 

From (35) by substitution 




II 

*<F 

(i,j = 1. 

, • • • , n) 

(38) 

Also define 





1 1 

k— xj • 

(i = I, 

■,*) (39) 

or 




r - 
r, D 

t* - 1. • 


m 


The expressions dx,/dp„ dx,/dl, K ;i are all properties of the 
demand functions and are empirically determinable under idea! 
conditions. We seek restrictions on them. 

Inspection of the determinant D reveals that it is symmetrical 
with respect to i and j, since 
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Hence, 


p- _ (~ X)Ay (- \)D Si 
H D ~ D 


Kir, 


dXj dx± _ dXi dXi 

dpi + ** dl ~ dpi + Xi ~dJ- 


(i,j = 1, •••,») 


(41) 

( 42 ) 


What is the economic interpretation of the compound term 


js- _ & x i _j_ dXj 

Kii dpi + *' dl ? 


It has been called by Slutsky the residual variability of the jth good 
for a compensated change in the ith price.* 

This can be made more clear by the following considerations. 
Thus far, we have imagined the individual to be maximizing his 
utility as of given prices and total expenditure. A little thought 
will reveal that utility will be maximized as of a given expenditure 
only if the level of utility which is being realized is being achieved 
in the cheapest possible way; i.e., expenditure must be minimized 
as of any level of utility. If this were not so, the same level could 
be achieved with some money left over; this remnant could be 
spent to buy more goods, and hence a still higher level of utility 
could be attained. 

Along any indifference .locus there exists for any set of prices 
an optimum set of purchases which minimizes total expenditure. 
That is, 

Xj = p j \jpi, ••■,£„, T(<p)]. (j - • • • > n ) (43) 

For 

U = F(<p) = constant, 

we are confined to the same level of utility. It could be easily 
shown 9 that 


K “ = CiX. 


constant 


= 'Pd- (i,j ~ •••>”) (44) 


In words, Ka is equal to the change in the quantity of the jth 
good with respect to the ith price, where the individual moves 

8 E. Slutsky, "Sulla teoria del bilancio del consumatore,” Giornale degli economisli, 
LI (1915), 19-23. 

? See chap, iv, pp. 62-65. 
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along the same indifference focus and keeps his expenditure down 
to a minimum before and after the change m price 10 

From the formulation of our equilibrium equations m (29) we 
have seen that the properties of the demand functions are not 
affected by our choice of utility index This may be shown ex 
phcitly from the identities 

U = F( v ) (45) 

Ui = F’ipi (46) 

U„ ~ F'tp t) -f F"<p,vj (47) 

(_ X ) - F'(- X'), (48) 

where ( — X) is the marginal utility of income for the utility 
index <p Let 



From (47) 

n I Ihi] P> 1 1 F'v,j + F 'w, | PA I F'vi, | pA 
I Pi 0 I | Pi 0 | - 1 Pj 0 | 

s {FT'D’ (SO) 


Similarly the following relationship holds for all cofactors 

D u - = ,n) (51) 

Hence 


(- \)Dp _ (- X)D’„ 
D ~ D 


(52) 


so that is an invariant under any transformation of utility 
index Literally nothing ts implied jor empirical price behavior by 
the choice of any particular utility index 

Inspection of our equilibrium conditions in the form of (24) 
and (19) reveals that they are unaffected by a proportional change 
m all prices and income our equilibrium values remain intact for 
such n change i e , 

Xt-FiPi p n I)=h‘(m P i mpn. ml) (i = l» .”) (55) 
where m is any positive number Mathematically our demand 

•i For another interpretat on see H Schultz The Theory and Measurement of 
Demand (Ch cago Untvers ty of Chicago Press 1938) pp 43-45 
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functions must be homogeneous oj order zero. Employing Euler’s 
theorem for homogeneous functions, we have 11 


dxi dxi 

dPi pl + W * p2 + 


, dXj t , dXi 

+ dp n Pn + 5 1 1 ~ °- 


(i = !,•••,«) (54) 


Dividing through by re,- yields the following relationships in terms 
of elasticity coefficients: 


V il + 7?t2 + ’ • • + Vi* + vu = 0, 


where 


Va = 


dXj pj 
dpj Xi 


(i = 1, •••,«) (55) 


is the elasticity of the 4 th good with respect to the price of the 
jth good, and 

dxi I 


Vu = 


dl Xi 


is the income elasticity of demand of the 4th good. Intuitively, 
we should expect this equality, since an upward movement in all 
prices is equivalent to a decrease in money income. 


Thus far our analysis has not been completely devoid of mean- 
ing. The following empirical restrictions have been found to hold 
for the demand functions. 

I. They are homogeneous of order zero; i.e., a simultaneous 
doubling of all prices and income leaves all quantities demanded 
invariant. This implies as we have seen 


dxi 

dpi 


P 1 + 


dXj 

dp 2 




dXi 

JT 




(*=!,•••,») (56) 


or in elasticity terms 

Vil + Vil + • ■ • + Vin = — Vu', = ' ' '» n ) 

i.e., the sum of the elasticities of a good with respect to each and 
every price is equal in absolute value, but opposite in sign, to the 


11 This may also be proved by substitution from (35). 


r— » dX\ d X ( « » / dx% 


dXi' 

\ dp; ’ dl . 


, tip; dl , \op; ai / 1 

by a well-known theorem on determinants that the expansion of the elements of c 
column with respect to the cofactors of a different column must vanish. 
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income elasticity of demand for that good These are n restnc 
tions which are not compatible with any and all price quantity 
behavior, and so are meaningful ,J 

Because of this homogeneity condition it is not necessary to 
take as independent variables the n prices and income These 
(» -f* 1) variables may be reduced to n variables by considering the 
ratios between any n and the remaining variable 

Thus, iv e may divide through by any price, say the price o( 
the first good, to get 


x, =* A'(pi, , p„ 


1) = h '( i ’pr -pA) 

(Pi Pi Pn £\ 

\pi' Pi' 'pi'piJ 


(*«1, ,n) (58) 


This is equivalent to setting the price of the first good equal to 
unity, and using it as our numeraire 

However, a more symmetrical measure suggests itself Divicf 
ing through by I, wc get 

*<-*■(*. .P«D >/- 0 

= ,tf) (t-l, ,») (59) 

n From our def mtion of income or total expenditure as 

i - i. #>i*t 

we have the following (« + 1 ) restrictions on elasticities of demand 
*1 t m 1 

and 


& 1 i — - */ 



is the proportion of income spent on the *th good However these are not meaning 
restrictions since they are consequences of our definition At best they cou 
reveal that we have not applied our defined operations with numerical accuracy 
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Hence, 



(J = !»•••,») 


, a n ). 


(60) 


The as here are very natural units to employ, since they involve 
only the dimensions of the respective quantities. In words, a,- may 
be defined as the proportion of total income required to purchase 
a single unit of the jth good. 

II. We have also the following reciprocal “integrability” con- 
ditions. 


jr OXj . OXj OXj OXj jy, /*• 1 \ /jC 4 \ 

Kji — Qp. + X{ — dpj "4* qj (i, j — 1, • • m 7 n) (61) 

i.e., the residual variability of the jth good for a compensated 
change in the ith price is identically equal to the corresponding 
term for the ith. good with respect to the jth price. These are 
n(n — l)/2 independent meaningful conditions. 13 


Meaningful Theorems 

Thus far almost nothing has been said about the directions of 
change in our equilibrium quantities of goods demanded with re- 
spect to changes in prices and income. Does the utility analysis 
have nothing to say upon this question? The answer can be sought 
along lines indicated in previous chapters. 

Before clouding the air with determinants, let us make a 
common sense appraisal of the situation to see whether we cannot 
suggest a simple answer. 

First, suppose the individual to be constrained to move along 
the same indifference locus. Let him be confronted with a set of 
prices, and attempt to attain this level of “utility” in the cheapest 

13 They hold, of course, only for the individual demand functions. Moreover, they 
reflect differential properties of our demand functions which are hard to visualize and 
hard to refute. For our empirical data consists of isolated points. These must be 
smoothed in some sense before our relations can be tested ; the smoothing, even by the 
best known statistical methods, is to a degree arbitrary, and so refutation and verification 
are difficult. 

I have tried, but thus far with no success, to deduce implications of our integrability 
conditions which can be expressed in finite form, i.e., be conceivably refutable mere > 
by a finite number of point observations. 
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possible way Consider the set of prices (pi 0 , , p n °) There 

will correspond to this an optimal set of quantities (xj 0 , , x n °) 

such that total expenditure is as low as possible 1 e , 


£ p.°X t == £ 
l i 

(62) 

where (J\f) is any other point on the locus 


Kvm = k<p(x°u 

(63) 

Consider now a second set of prices, (pi 1 , 
corresponding optimal set of goods, (xt 1 , x n 

same indifference locus as the first Then 

, pn), and the 
‘), lying on the 

£ p t l x> = £ P.V 

i i 

(64) 

In equations (62) and (64) any values of x (along the same locus) 
may be inserted in the respective left-hand sides In particular, 
we may write them respectively 

£ pm b £ p,w, 

i i 

(65) 

and 


£p.V ^ LPiW 

(66) 


This means that the optimal set of goods for each respective set of 
prices cannot cost more than the other set of goods (optimal for a 


different set of prices) 

Rewriting the equations, wc get 


£ p.°(*.° - *.') ^ o, 

(67) 

£ m*. 1 - *.°) - 0 

(68) 

Add these two equations to get 


£ (p: - P.°K*. 1 - *.°) - o 

(69) 


If the two equilibrium points are assumed to be always distinct, 
and if a proper absolute minimum is assumed to be always realized, 
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then the equality sign may be dropped, and this may be rewritten 

E {Pi 1 - P<°) (x,- 1 - xS) < 0, (70) 

This may also be written 

n 

Z ApiAxi < 0, not all Ap { = 0. (71) 

Suppose we allow but one price to change, say the &th ; then 
all but one term of (71) vanishes and we have 

Axi : Apk < 0; (k = 1, •••,») (72) 

i.e., as the htli price increases, all other prices being held constant, less 
•mill be bought of the ktk good. It must be emphasized that this 
holds only for a movement along the same indifference locus, i.e., 
for a compensated change in price, and does not mean that with a 
given money income a change in one price will necessarily result in a 
decreased amount taken of the corresponding commodity. It will 
be noted that the above proof does not involve the calculus at all; 
using only the operations of addition and subtraction, the definition 
of a maximum position may be utilized to derive meaningful finite 
demand restrictions. 

Employing only the most elementary logical and arithmetical 
operations, we can advance matters still further. Consider any 
initial set of prices and income {p i°, • • •, p n °, I 0 )- Corresponding 
to this there will exist one or more optimal sets of goods. Select 
one of these and designate it by (xf, • • x n ° ). Consider now a 
second set of prices and income {pi, ■ • • , pn, I 1 ), and a correspond- 
ing optimal set of goods (rei 1 , • • •, xf). 

Let us consider what would have been the cost of the second 
batch of goods at the prices of the first. This will be 

PiW + ptW + - • • + pM = E piW. (73) 

1 

If this cost is equal to or less than the amount of money that the 
first batch actually cost, we have conclusive evidence that the 
second batch is not higher on the individual’s preference scale than 
the first batch; for if it were, the individual could not have been 
in equilibrium in the first place, since he would not be minimizing 
total expenditure for the attained level of satisfaction. In other 
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words, if he could have bought the second batch, and he bought 
the first, we rule out the possibility that he prefers the second to 
the first. 

Our theorem is 

t pM S £ PM implies Fl<p{X')] 5 ( 74 ) 

More specifically, 

£ PM < £ pM implies /'OCA'*)] < (75) 

Similarly, 

£ PM < £ pM would imply f[p(X 0 )J < /£*(**)]. (76) 
It is obvious that 

£ PM < Z PM, (77) 

t 1 

and 

£pM< ZPM (78) 

1 1 

cannot both hold simultaneously, for this would imply 

FM*)] < J 09) 

and 

FIX* 1 )] > W)} ( 8 °) 

which is a contradiction. 

This gives us a condition which holds for any movements, not 
merely for compensated ones. 

Equations (77) and (78) may be written 

£ p,“(r.' - x,”) < 0 implies £ - x,V < 0, TO 

1 1 

hr 

£ PAx, < 0 implies 23 ( P . *f Ap,)Ax, < 0. (82) 


If we assume that our demand functions are single-valued, and 
-x.e to consider only distinct points, this may be broadened to 
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the following form : 

» rt 

E Pitei = 0 implies £ (£ t - + Ap t )Ax { < 0. (83) 

The importance of this result can hardly be overemphasized. 
In this simple formula are contained almost all the meaningful 
empirical implications of the whole pure theory of consumer’s 
choice. Moreover, these are expressed in the form which is most 
suitable for empirical verification. So fundamental is this condi- 
tion that (as I have shown elsewhere) it provides a foundation for 
the theory of economic index numbers and the utility analysis, and 
affords the most convenient path for the derivation of all known 
restrictions upon the individual and general demand functions. 14 

From this condition alone can be derived the following restric- 
tions upon the demand functions: 

(a) They must be single-valued; i.e., to each set of prices and 
income there corresponds a unique set of goods. 

(b) They must be homogeneous of order zero; i.e., a change in 
all prices and income in the same proportion must leave all quan- 
tities unchanged. All the properties of condition I of the previous 
section must therefore hold. 

(c) All known valid qualitative restrictions upon the slopes of 
the demand functions. These will be indicated below. 

Elsewhere 15 I have suggested as new foundations for the pure 
theory of consumer’s behavior the conditions (a) and (b) and 
equations (83). At that time I did not realize that (a) and (b) 
were redundant in the sense that they themselves could be deduced 
as theorems from the assumption of (83) alone. In other words, 
this single condition provides us with complete foundations for the 
theory (with the reservation concerning integrability). 

The proof of (a) and (b) as theorems can be indicated simul- 
taneously. Consider an initial price and income situation (pi , • • • , 
Pn°,I°). Corresponding there is a set of goods (xi°, •••,#„). 
Suppose now all prices and income to be multiplied by the same 

14 The only point upon which this formulation does not throw light is that of in 
tegrability. Even here, a proof may still be forthcoming by which this condition may 
be slightly generalized to include the question of integrability. 

16 “A Note on the Pure Theory of Consumer’s Behavior, Economica, e ruarj , 
1938, pp. 61-71. 
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positive quantity, m, {vipS, , mp,°, mP) Corresponding there 
is a second set of quantities (x, 1 , ,x„‘) We wish to pros e that 

the second batch of goods is identical, commodity for commodity 
with the first 


By hypothesis, 


Hence, 

I 1 = mP 

(84) 


Z P'Xx 1 = m £ p t °x t a 
i i 

(85) 

Also, 

Therefore, 

P, 1 = mp (i = 1, , n) 

(86) 


i P°x' - £ P.V 

(8?) 

But also 

i p.v « i p.w 

i l 

(88) 

But this is a contradiction, for condition (83) says that 


ip. 

W = £ pi°x x ° implies £ p y l xf > £ pxW 

i i i 

(89) 

Therefore, these cannot be two distinct points Hence, 



= x,° (i = 1, , ») 

(90) 

For m = 1 , condition, (a) is deduced as a special case of (b) 


Thus far, we have made no use of the calculus We may now 
proceed to derive conditions upon the various partial derivatives 
of our demand functions 

Let us go through a limiting process and write (83) in the 
following differential form 


£ dp4x, < 0, 

(91) 


for 


£ pidx t « 0, 


lot alt dx x or dp x ** 0 



THEORY OF CONSUMER'S BEHAVIOR 113 

In this expression the dx’s and the dp's are differentials, not 
infinitesimal increments. 

Regarding prices and income as independent variables, from 
our demand functions of (20) we have 

n 

dxi = 22 hj'dpj + hpdl. (i = !,•••,«) (92) 


But for 22 p,dxj — 0, 

1 

dl - £ Pjdxj + £ Xjdpj = £ Xjdpj. (93) 

1 1 1 

Therefore, 

dx x = 22 (h/ + x,ht l )dp , = 22 K tj dpj. (94) 

1 1 

Equation (91) becomes 

22 £ ( h *■ + x jhi‘)dpidp, s 0, (95) 

1 1 

or 


£ £ (K,j)dpidpj ^ 0. (96) 

1 1 

This is a negative semi-definite form ; semi-definite because for all 
prices changing in the same proportion, it vanishes due to the 
homogeneity condition. 

This is also derivable in at least two other ways. From 
equation (71) 

£ ApAxi ^ 0 along an indifference locus (97) 
1 

1 his may be written 

n 

£ dpidx t < 0 along an indifference locus. 

1 

(not all differentials vanishing). (98) 
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But along an indifference locus, from (43) 

x % ~ •••,/>», F(<p)2> (* = 1 ,■••,») ( 99 ) 

where i£* is homogeneous of order zero in the £’&. Also 

dx x - £ f/dpj. (i = 1, . • • , n) (100) 

Therefore, 

£ £ tpj'dpidpj S 0 . ( 101 ) 

i i 

But, of course, for a movement along an indifference locus, i e., 
for a compensated price change, 

J£ ( W2 ) 

Therefore, (101) may be written. 

£ £ K X) dpjdp t 2 0. (103) 

i i 

Finally, in the Mathematical Appendix the following algebraic 
theorem is stated : Let 

= [^] - 04,.], (i,j- l. •.») CM) 

be the first n by n matrix of the inverse matrix of [Pi Then 

£ £ A i} hJh S 0 (1°5) 

i i 

because [£>] is the matrix of a negative definite form under con- 
straint. Now from (38) 

K ti = (- \)A Jt . ( I06 > 

Since (—X) >0, our theorem again follows. This is a direct 
algebraic proof of our theorem. 

The meaning of the requirement that the form in (96) be nega- 
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tive semi-definite may be briefly indicated. 16 

Let 


K x i K lt ■■■ 

Ki n 


K Z1 K 22 • • • 

k 2 „ 

K - \K„ \ = 

• 

• 


K~ni K. n2 • - . 

K nn 


Then because of the semi-definiteness 

\K\ = 0 and £ £ Rtf#, = 0. 


US 


(107) 


i i 


(108) 


However, the principal minors beginning with the first alternate in 
sign from negative to positive, i.e., 


\K U \< 0 ; 


Ku Ki2 

K n K-22 


> 0 ; 


Kn 

K n 

K 

K n 

K 22 

K, 

K S1 

k 32 

K 


< 0, etc. (109) 


Thus, the following demand restrictions are implied. 

(* = 1, •••,») (HO) 

\dpi + x ' dl ) \dpj + X] dl ) \dp, + 3 dl ) > u ’ 

(i,j= 1, — , n), (i j), etc. (Ill) 


3xj . dX j 
dpi + x i-gJ < °> 


Condition (110) was first developed by W. E. Johnson and 
Eugen Slutsky, presumably independently. We see that it is not 
possible to deduce that 

|£< 0, (112) 

dpi 

the ordinary expression for the “law of demand.” For if dXi/dl 
be sufficiently negative, dxi/ dpi can be algebraically positive. This 
is the phenomenon alluded to in the well-known Giffen’s Paradox. 17 

16 If integrability is not assumed and Ka ^ Ka, the exposition may be easily 
modified by the substitution throughout of the term (it,-,- + Kj;)/2. 

17 It is only by making additional, and demonstrably arbitrary, assumptions that 
various writers have been able to derive the so-called law of diminishing demand. 
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The assumption that the form in (96) be symmetrical and nega- 
tive semi-definite completely exhausts the empirical implications 
of the utility analysis All other demand restrictions can be de- 
rived as theorems from this single assumption These arc bold 
statements, but they are substantiated by the fact that it is possible 
to work backwards from the assumption of (96) to an integrate 
preference field displaying the properties necessary for a maximum n 

Conclusion 

We have come a long way in this chapter. Despite its lofty 
beginnings, the pure theory of consumer’s behavior, when its em- 
pirical meaning is finally distilled from it, turns out to be one simple 
hypothesis on price and quantity behavior. This may be written 


for 


Z < 0 , 


£ S 0, not all Ar< =* 0, 


subject to the qualifications indicated above. Alternatively, we 
may u rite this as 

11 Only a sketch of the proof of this statement need be given Write 
Xl ■» jf'(ai, 6*1. >*) 

dn 9x l ,{ dx ' , -A 

Ku ~dZ+* > di~ \3 a , r/ r*» ' 


Define a new set of variables 
fit " /‘(“i- 


* - . ft.) 


Xi-WiPtf „ ,/?.), .«.)3-6W. tfi& 


Then there extsls a function 


.ft.) 


v ** 

• which satisfies the properties of our preference field 
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where the equality sign holds only for all prices changing in the 
same proportion. 

Many writers have held the utility analysis to be an integral 
and important part of economic theory. Some have even sought 
to employ its applicability as a test criterion by which economics 
might be separated from the other social sciences. Nevertheless, 
I wonder how much economic theory would be changed if either 
of the two conditions above were found to be empirically untrue. 
I suspect, very little. 

A Note on the Demand for Money 

One special problem in the theory of value has been touched 
upon at various places in the previous discussions, namely, the 
value of money itself. Probably more has been written upon this 
subject than upon any other in economics, and most of the issues 
raised are not germane to the present investigation. However, it 
is a fair question as to the relationship between the demand for 
money and the ordinal preference fields met in utility theory. In 
this connection, I have reference to none of the tenuous concepts 
of money, as a numeraire commodity, or as a composite commodity, 
but to money proper, the distinguishing features of which are its 
indirect usefulness, not for its own sake but for what it can buy, 
its conventional acceptability, its not being “used up" by use, 
etc., etc. 

The most interesting problems arising in connection with money 
are linked up with the fact of “uncertainty” in the most general 
sense, leading into liquidity considerations which cannot be dis- 
cussed here. However, it should be possible to indicate in a few 
pages the way out of certain false dilemmas connected with the 
demand for money for so-called transactions purposes . 19 

On the one hand, there are writers like Mises who would explain 
the value of money in marginal utility terms such as might be 
applicable to any commodity; on the other hand, to economists 
like Schumpeter the peculiarity of money lies in its essential lack of 
direct usefulness and in the fact that its value is not capable of 
explanation in the usual utility terms. On the whole the latter 

13 For a summary of the extensive Continental speculations on these matters see 
Howard S. Ellis, German Monetary Theory , 1905-1933 (Cambridge, Mass., 1934), art 
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view is the least misleading, but, as Walras showed many years 
ago, it is possible to modify utility analysis so as to take account 
of the peculiar properties of money The latter, who above all 
others developed the notion of general equilibrium in which a!! 
magnitudes are simultaneously determined by efficacious inter 
dependent relations was able to remain undisturbed by the fears 
of literary writers that there was something viciously circular m 
assuming the existence of prices and of a "value for money in the 
midst of the process by which that value was to be determined 
Today after the recent contributions of Keynes it is particularly 
rewarding to go back to re&xamine the elaborate discussion of 
liquidity preference, encaisse destree, etc , in Walras The latter 
was so sophisticated as to have outgrown the quantity equation in 
later editions of his work, although he continued to believe in what 
is today called the "quantity theory”, very rightly, in my opinion 
he in effect reversed the commonly met dictum that "the quantity 
theory should be scrapped but the quantity equation is useful ” 
Here I shall only consider the demand for the holding of money 
by the consumer As before, ordinal utility or preference depends 
upon all commodities but the (« + l)th good, M, will be taken to 
be money, which yields benefit only in its ultimately being given 
up Possession of an average amount of it yields convenience m 
permitting the consumer to take advantage of offers of sale, in 
facilitating exchanges, in bridging the gap between receipt of in 
come and expenditure, etc The average balance is both used and 
at the same time not used , it revolves but is not depleted , its just 
being there to meet contingencies is valuable even if the con 
tingencies do not materialize, ex post Possession of this balance 
then yields a real service, which can be compared with the direct 
utilities from the consumption of sugar, tobacco, etc , in the sense 
that there is some margin at which the individual would be in 
different between having more tobacco and Jess ol a casn oaianct, 
with all of the inconvenience which the latter condition implies 
But there is this difference Given physical amounts of to 
bacco, food, ballet, etc have significance in terms of the want 
pattern of the consumer, but it is not possible to attach similar 
significance to a given number of physical units of money, say to 
a number of ounces of gold It would be otherwise in the case of 
^old which was to be used to fill teeth, but such uses of money in 
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the industrial arts we purposely neglect. The amount of money 
which is needed depends upon the work that is to be done, which 
in turn depends upon the prices of all goods in terms of gold. 

The above remarks are by now so well known as to appear trite 
and trivial. But let us translate them into mathematics. Our 
ordinal utility is now a function, not alone of the physical quan- 
tities of goods, but it has within it prices as well. This is a serious, 
significant alteration, for, as we shall see, the empirical properties 
of the demand functions are changed by this innovation. This is 
not the only case in which economists have found it necessary to 
introduce prices into the indifference loci; there is also the example 
of goods which have snob appeal, or scarcity appeal, which are 
valued for their exclusiveness so that preference for them is altered 
by changes in their relative prices. This Veblenesque effect need 
not detain us here. 

Our utility function will be of the form 


U(Xi, • , X n , Mp m , p\i ' * , pn) 

— U (x 1, *, x ni RT\p m , \ph 


■APn) 


- F\^x 1 , •••,*., pi , Pn )> 


( 113 ) 


where the function is homogeneous of order zero in all of the prices, 
so that a doubling of all prices (including the price of gold) at the 
same time that quantities are unchanged will leave ordinal utility 
unchanged. This stems from the hypothesis, which need not be 
true in the short run or under particular expectations, that money 
is evaluated only in terms of the work which it has to do. It will 
be noted that I have not set the price of money equal to one. 
Actually, we shall avoid confusion in thought if we refrain from 
doing so. Any other commodity may be used as numeraire, or we 
may express prices in any units we like. Of course, it will still be 
possible to speak of the gold price of things, and after we have 
learned to do without the use of the money commodity gold as 
numeraire, we can then fall back upon the simple convention of 
expressing prices in terms of it. Of course no reader will think 
that I attach any particular importance to gold or any other metal ; 
any conventional unit which serves as money will do. 
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Walras was careful to point out another important distinction 
consumption of goods is a flow per unit time, so much tobacco per 
year etc , but the gold balance is an inventory or stock We may 
speak of its price in two senses, the price of gold compared to the 
prices of other things as one ounce of gold is worth two beaver 
skms while a pound of tobacco is worth three beaver skins, or an 
ounce of gold is worth two ‘ anythmgs while a pound of tobacco 
is worth three ‘anythmgs In our notation this is p m But we 
may also speak of the price of the use of gold per unit time In a 
capital market where people can both borrow and lend at a gi\en 
rate of interest, this price is necessarily related to the rate of in 
terest This is true even if the individual in question does not 
have to stay in debt for the amount of his cash balance, in any case 
there is the opportunity cost of holding money in the sense of the 
interest he might have earned by lending this sum 

If we wish to exclude dynamical considerations from our dis 
cussion as much as possible, the simplest assumption would seem 
to be that the individual maximizes the above expression subject 
to the following budget equation 


i P,x, + rp m M = / (114) 

i 


where prices, income, and interest r are given to the individual 
Thus, the amounts paid out (or foregone) in every period for the 
use of money are treated as subtractions from income available for 
expenditure upon consumer’s goods The conditions of equi 
librium are exactly as in chapter v equation (29), except that the 
marginal utilities of the goods are affected by the level of prices 
directly, and we now have an additional unknown, M, to be de 
termmed But we also have an additional equation 




(115) 


Thus, the use of money is bought up to the point where its marginal 
utility (convenience) is proportional to its cost, or the interest 
charges which must be paid for its use The secondary inequalities 
are exactly as in the usual case i e , the principal minors of the 
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bordered Hessian must oscillate in sign, with money treated as an 
(n + l)th good. 

From the conditions of equilibrium our demand curves are as 
follows, 

*x- = k*(Pu ■ • ■ » pn, pm, I, r), (i = 1 , • • - , n) ... .. 

M — M(pi, — , pn, pm, I, r). ( 116 ) 

These equations are homogeneous of order zero in all prices and 
income just as in the usual case of demand. With the interest 
rate unchanged, doubling all prices (including the price of gold) 
and income will leave all quantities unchanged. However, if we 
double all prices and income in terms of the price of gold, then 
with r unchanged, the demand for physical quantities of money 
will be exactly doubled. Mathematically M is a homogeneous 
function of order one in terms of all other prices and income. This 
is the peculiar feature of money. Another way of saying the same 
thing is the assertion that the demand for money in terms of its 
own price, all other prices held constant, is necessarily of unitary 
elasticity. This is a familiar proposition in the history of the 
quantity theory. The reader will note that this is a meaningful, 
refutable hypothesis which is capable of being tested under ideal 
observational conditions . 20 

The further qualitative properties of the demand schedules can 
be determined by differentiation of our equilibrium equations. 
However, they will not be so simple as those in the usual demand 
case, because in varying any one price, we shift every equilibrium 
equation through the direct influence of each price on marginal 
utilities. Still, income elasticities behave more or less the same as 
before; also the demand for money in terms of the interest rate 
behaves just like the demand for any good in terms of its own price 
under the usual utility assumptions. Thus, if interest is raised, 
and at the same time income is increased so as to leave the indi- 
vidual equally well off, then the amount of mone3 r demanded must 
fall. If money is not an inferior good, and we should expect it not 
to be, then an increase in interest will lower the amount of money 
demanded even for an uncompensated change. 

20 Jacob Marschak, “Money Illusion and Demand Analysis, 1 Review of Economic 
Statistics, February, 1943, pp. 40-48. 
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The complete results of such differentiations are given below 
in partitioned matrix form a 


A/ h Pn ' hr hr 


\Mj M Pa Mr Mr 


JT dU % dU t dU. 

u " W p - af +u •• if. 0 0 


= -\dU, a 2 u 


aw aw 


\dM dM* p ”\ map, maf+ yr 0 ^ 


rM -1 p m M\ 


We may leave as an exercise to the interested reader to verify 
this and to work out the interesting and special case where the 
ordinal preference field takes the simple form U(xi x n p n M/I) 


Qualifications Introduced by Uncertainty 
The above analysis of the demand for money confines itself to 
the so called transaction aspects The really interesting problems 
arise when uncertainty as well as transaction friction is admitted 
into the picture Some of the best lines ever written in this con 
nection are those to be found in Keynes’s General Theory and in 
Hicks’s Value and Capital Space can be devoted to only one com 
ment here In the past the structure of interest rates and assets 
has been subject to onfy perfunctory notice so that the recent 
analysis m terms of liquidity preference is of the greatest im 
portance But it would be a mistake which would be as damaging 
to further analysis of liquidity preference as it would be to classical 
doctrines, if it were thought that uncertainty and liquidity differ 
entials are the sine qua non for the existence of a rate of interest 

11 Reference may be made here to the valuable contribution ofC E V Leser The 
jmer s Demand for Money Eccmometrtca vol XI no 2 (April 1943) pp 123-14 
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Such a view can be compared with a theory of land rent based upon 
differences in the quality of different kinds of land. I believe that 
the analogy is not a superficial one. 

Specifically I think it would be misleading to say that interest 
is simply the price of money; it would be quite possible to have an 
economy in which money did not exist, and in which there still was 
a substantial rate of interest. 22 Nor can I agree with all of the 
formulations of Hicks’s brilliant chapter xiii on Interest and Money, 
such as “. . . it must be the trouble of making transactions which 
explains the short rate of interest” (p. 165);” ... if bills are 
are perfectly safe, [there would be] no reason for the bills to fall to a 
discount” (p. 165); “Securities which are not generally acceptable 
in payment of debts bear some interest because they are imper- 
fectly ‘money’ ”(p. 167). 

It is true that in a world involving no transaction friction and 
no uncertainty, there would be no reason for a spread between the 
yield on any two assets, and hence there would be no difference in 
the yield on money and on securities. Hicks concludes, therefore, 
that securities will not bear interest but will accommodate them- 
selves to the yield on money. It is equally possible and more 
illuminating to suppose that under these conditions money adjusts 
itself to the yield of securities. In fact, in such a world securities 
themselves would circulate as money and be acceptable in trans- 
actions; demand bank deposits would bear interest, just as they 
often did in this country in the period of the twenties. 23 And if 
money could not make the adjustment, as in the case of metal 
counters which Aristotle tells us are barren, it would pass out of 
use, wither away and die, become a free good. 

In the preface to the reissue of Risk, Uncertainty and Profit 24 
Frank Knight makes the penetrating observation that under the 
conditions envisaged above the velocity of circulation would be- 
come infinite and so would the price level. This is perhaps an 

22 Elsewhere I have developed this notion briefly under the title The Rate of 
Interest under Ideal Conditions,” Quarterly Journal of Economics, L 1 1 1 (February, 
1939), 286-297. 

23 In a world without uncertainty where money has the same yield as other assets, 
its velocity becomes indeterminate. Note that this is the case when the interest rate 
is zero. 

2i Risk, Uncertainty and Profit (London School of Economics and Political Science. 
Series of Reprints of Scarce Tracts, no. 16, 1933), p. xxii. 
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over dramatic way of saying that nobody would hold money and 
it would become a free good to go into the category of shells and 
other things which once served as money We should expect too 
that it would not only pass out of circulation but it would cease to 
be used as a conventional numeraire in terms of which prices are 
evpressed Interest bearing money would emerge 

Of course the above does not happen in real life precisely be 
cause uncertainty contingency needs non synchronization of 
revenues and outlay transaction frictions etc etc all are with us 
But the abstract special case analyzed above should warn us against 
the facile assumption that the average levels of the structure of 
interest rates are determined solely or primarily by these differ 
ential factors At times they are primary and at other times such 
as the twenties m this country they may not be As a generahza 
tion I should hazard the hypothesis that they are likely to be of 
great importance m an economy in which there is a quasi zero 
rate of interest I think by this hypothesis one can explatn many 
of the anomalies of the United States money market in the thirties 
A further defect of the reasoning that I have been criticizing 1 es 
in the tendency to regard as universal the hypothesis that interest 
yield is inversely correlated with an asset s nearness to money so 
that normally long rates are above short rates This is not m 
accord with much of economic history for reasons which I do not 
think are difficult to elucidate “ 

*» See D Durand Bas c V elds of Corporate Bonds 1900-1942 Techn cal Paper 3 
(New York Nat anal Bureau of Econom c Research 1942) Also F A Lutz e 
Structure of Interest Rates Quarterly Journal of Economics LV (1940) 36-63 
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TRANSFORMATIONS, COMPOSITE COMMODITIES, 
AND RATIONING 

Logarithmic Transformations and Elasticities 

Through the influence of Alfred Marshall economists have de- 
veloped a fondness for certain dimensionless expressions called 
elasticity coefficients. On the whole, it appears that their impor- 
tance is not very great except possibly as mental exercises for be- 
ginning students . 1 As we have seen, most of the “laws of eco- 
nomics” are qualitative and ordinal rather than quantitative, and 
where such are concerned the problem of dimensions is of no 
consequence. 

Furthermore, while elasticity expressions are invariant under 
changes of scale, they are not invariant under changes of origin. 
Since there are no natural zeros from which we measure economic 
magnitudes, the elasticity expressions can be seen to be essentially 
arbitrary. Thus we meet in economic analysis such concepts as 
exports, net purchases, amount of factors supplied, etc., all of 
which are differences measured from arbitrary bases . 2 

Mathematically an elasticity expression between two magni- 
tudes, such as price and quantity, consists simply of the logarithm 
of one of these quantities differentiated with respect to the log- 
arithm of the other. Thus, 

Vz P = ( dx/dp)(p/x ) = d log x/ d log p = Ex/Ep . 3 

1 There is perhaps some usefulness of the concept of elasticity of demand as giving 
an indication of the qualitative behavior of total revenue, but even this is only the 
consequence of neglecting to deal with total revenue directly. 

2 This was realized, somewhat inadequately, by Wicksteed who denied the validity 
of the concept of supply, preferring to use the notion of “reserve demand’’; only he 
went to the opposite extreme in attaching sacredness to his reformulation. 

3 A calculus of operations with elasticity terms can be devised on the basis of the 
properties of logarithmic .derivatives. See R. G. D. Allen, Mathematical Analysis for 
Economists (London, 1937), chap, x, particularly the reference there to the work of 
D. G. Champernowne. 
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Not only are elasticity expressions more or less useless, but in 
more complicated systems they become an actual nuisance, 4 con- 
verting symmetrical expressions into asymmetrical ones, and hiding 
the definiteness of quadratic forms. This could be illustrated by 
utility analysis, but can be amply demonstrated by the slightly 
simpler case of profit maximization, where constraints do not enter 
the picture. Let us consider a firm buying inputs («,, * . ,v n ) in 
perfectly competitive markets at given prices (zz> lp • • , w n ) With 
the demand curve for its product and its production function being 
known, total revenue becomes a determinate function of the inputs 
bought. Total expenditure being defined as the sum of the 
amounts paid to all the factors of production, it is clear that the 
hrm's profit can be written in the form 


v = R( vt, • , v„) - £ iVjVj. 


(1) 


For profits to be at a regular maximum, we must have 

j?,(iij, > O — = 0, (z — 1, , n) 

[7?„3 negative definite. (2) 

Suppose we are interested in the inverse demand functions 
giving the amount of each factor of production which will be bought 
at each set of prices. By explicit differentiation of our equilibrium 
conditions above, we find 

£R ti r i . w 

Since the inverse of a symmetrical negative definite matrix is itself 

* Actually it is a little misleading to say that an elasticity expression is necessarily 
“without dimension " For take any absolute derivative , such as dx/dp, which is cer- 
tainly not dimensionless, involving as it does the dimensions [output times output divide" 
by value} Even though it has dimensions, it is still the elasticity of some expression 
Thus if 

* = HP), 

and 

y « *(«>. 

where 

y *= 1 = 


F— 1 = 

Idw,] 


then 
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symmetrical and negative definite, the complete conditions on the 
demand functions are summarized in the above expression. 

If one were interested in the corresponding elasticity coefficients 
[Evi/Ewf], one could as an afterthought introduce suitable factors 
into the matrix of equation ( 3 ). This usual procedure has been 
termed by Professor Lange the “ indirect ” method . 5 He suggests 
an alternative “direct” method. I should like to point out a third 
method, which carries out the Lange procedure to its logical con- 
clusion. Before doing so, however, I should like to express the 
opinion that in this matter “indirectness” is a virtue rather than 
a drawback. 

It will simplify the discussion if we adopt a mnemonic notation 
whereby the Jacobian of one set of variables with respect to another 
is written in a form reminiscent of that for a single derivative; i.e., 


J bu 1 , • • •. *») = [|r] = %' ^ 

The reader may then verify the identities 

dzdy _ dz 
dydx ~ dx ’ 

dy ~ \_dx\ 

which parallel exactly those for single derivatives. 

In terms of this notation the Jacobians of the transformations 


Vi = log Vi, Vi = e v< 

Wi = log w u Wi = e Wi 


{i = 1, •••,») 


( 6 ) 


can be written as 



where 5, 7 is the Kronecker delta, equal to one for like subscripts 
and vanishing for all others. 

6 Oscar Lange, “Theoretical Derivation of Elasticities of Demand and Supply: The 
Direct Method,” Econometrica, X (1942), 193-214. 
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Then it follows that 


dV 

dW 


dV dv du 
dv dwdW 




(8) 


This is the so called indirect method The Lange direct method 
involves computing the same matrix by the following equality 


dV_ 

d\V 


f dWT-< 

’ v,R t) * ~ J 

Ydv\ - 

w t 


There seems to be no special advantage in this procedure on 
the other hand there is the great disadvantage of loss of symmetry 
prior to inversion and a cloaking of the definiteness properties of 
R and its inverse Of course, what is co\ ered up can later be 
unscrambled, but a good deal of waste motion would seem to be 
involved 

However if we wish to carry the direct method to its final con 
elusion it would seem logical to replace the k’s and ws in the 
original profit expression by their equivalences in terms of the 
variables V and TV With the latter being given, the firm would 
vary the V’s so as to maximize profits The conditions of equi 
librium would be 


S t (V„ Vn) - eW = 0, (t = 1, , n) (10) 

and 

- vfiv.6,,1 = H 

negative definite In the above expressions it is to be understood 
th lt S(V lt ,V n )=R(e v \ , «*•), (11) 


and that subscripts stand as usual for differentiation 

If we differentiate our equilibrium conditions we easily derive 
the identity 

( 12 ) 


where H~ l is symmetrical and negative definite Thus except for 
column factors the elasticity matrix is symmetrical By dividing 
through with these factors and defining K t = vnv,fL we 
easily arrive at the familiar identity * 

K{q, } = KjT}}, 

■ H Schultz Theory and Measurement of Demand (Chicago 19J8) chap xix 


(13) 



TRANSFORMATIONS, COMMODITIES, RATIONING 129 
Also we may verify the relations 


Tja 0 , 


Va Vij 
Vji Vi) 


> 0, etc. 


(14) 


Relations (13) and (14) are also immediately derivable from the 
indirect method, but not easily from the Lange method. 

If one must use “dimensionless” expressions, it would seem de- 
sirable to replace the usual elasticity expression by the more sym- 
metrical coefficient 


Mu 


dVj V WiWj 

d w i V ViVj 


(15) 


The reader can easily verify that symmetry and definiteness are 
preserved in all matrices, that these coefficients are pure numbers, 
which in the case of diagonal elements reduce to the old definition 
of elasticity. 

I should like to conclude this discussion of elasticity expressions 
with a warning that the transformations break down when any of 
the variables are zero or change sign. In earlier sections it has 
been shown that productive services or inputs can be regarded 
simply as negative outputs. These can be converted into log- 
arithms only after a reversal of sign. This would offer no problems 
were it not for the fact that many of the magnitudes are neither 
always positive nor always negative. As an illustration of the 
difficulties which may arise, consider the classical case where the 
total of all expenditure is taken as zero. Here the partial deriva- 
tives of the various goods with respect to hypothetical income 
changes have a definite meaning, whereas their elasticities cannot 
be defined. 


General Transformation of Independent Variables 

The logarithmic transformation discussed in the previous sec- 
tion is but one special case of the transformation of our inde- 
pendent variables. It is desirable to examine the behavior of our 
equilibrium conditions under general transformation of our inde- 
pendent variables because in economics as in other disciplines we 
wish to free ourselves from any particular scheme of reference, or 
coordinate space, in favor of generalized coordinates. This desire 
is not only aesthetic but is necessitated by the fact that in real life 
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\\ e do not always meet naturally defined commodities In business 
cycle theory we often use the concept of "the cost of living ” taken 
to be the pace of some composite market basket of commodities 
But even if we confine our attention to what is ordinarily called a 
commodity, such as “wheat,” we find ourselves dealing with a 
composite commodity made up of winter wheat spring wheat of 
varying grades Each of these in turn is a composite of hetero 
geneous components, and so forth in infinite regression 

There is a related reason for analyzing the effects of a general 
transformation Among the many recent advances contained in 
Professor J R Hicks’s Value and Capital perhaps the most impor 
tant from an analytical point of view was the enunciation of the 
principle that a group of commodities has the property of a single 
commodity if their prices all change m the same proportion This 
theorem had been partially anticipated by Professor Leontief 7 but 
it was Professor Hicks who made it the cornerstone of his exposi 
tion E\ery mathematical economist, working in many variables 
finds difficulty m explaining his theories in words or diagrams By 
means of the Hicks Leontief theorem all but one of the variables 
can be lumped together as a single \ ariable, and a tolerably 
simple literary explanation cap be phrased We shall see how this 
theorem emerges as a special case of general theorems concerning 
transformations 

The most general conditions of equilibrium are of the type first 
encountered m chapter 11 equation (1), which we may rewrite in 
matrix form with suppressed subscripts as 

f(x, a) = 0 ( 16 ) 


If we subject the independent variables to the non singular 
transformation 

* = rw, a?) 

whose Jacobian 

t = PV3 


never vanishes then our equilibrium conditions become 

jinx) <o -/(*,«) = 0 a*) 

7 VV Leontief Composite Coraraodites and the Problem of Index Numbers 
itnca vol IV (1936) 
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If the relationship between the equilibrium values of our un- 
knowns and the parameters was given by 


x = g(a), 

the new relationship will be 

x = Q( a ), 

which will be equivalent to 

T(x) = g(a). 

Most importantly, it can be easily verified that 

dx _ dx dx 
da dx da 


(19) 

( 20 ) 
( 21 ) 

( 22 ) 


We may turn directly to the general extremum problem where 
some quantity is to be at a relative maximum or minimum with 
respect to n independent variables, which are themselves under 
m constraints. 8 In matrix notation we have 

2 = f(x) (23) 

G(x) = 0. 

At a regular maximum it is necessary that the matrix 

Cf„ GJ (24) 

be of rank m, and that 

h'H xz h (25) 

be negative definite subject to 

h'G x = 0. 

where 

H„ = tUr, + £ < 26 > 

1 

where the X’s are Lagrangean multipliers. 

After transformation we have 

U-„ &J - [!J I/.. GJ. (27) 


* See Mathematical Appendix A, Section III. 
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where prime on a matrix £a indicates its transpose [a,1 
Since the first matrix on the right hand side is non singular the 
first order rank conditions are invariant under transformation It 
is also easy to show that 

k Rxifc — WlhJi (2S) 

where 

hG z s h'G, m 0 

so that the second order conditions are invariant under trans 
formation 

Alternatively this might have been proved by showing the in 
variance of the relevant bordered determinants m consequence of 
the relation 



It will be noted that the Lagrangean multipliers (Xi Am) 
are invariant under transformations of independent v ambles 
There are economic as well as mathematical explanations of this 
If we insert parameters (ai a n ) in our equations above so that 

our constraints become 

G k (x) - a* = 0 (30) 

then one can easily verify b> direct differentiation that 


|>-1 -[-*.] ( 3 » 

\_acck Jo her os co Jnt 

Earlier we have derived special cases of this in connection with the 
marginal utility of income (chapter v p 100) and marginal cost 
(chapter iv p 65) 

The above relationship can be derived heuristically by differ 
entiating II with respect to holding ill other variables constant 
As with other operations involving Logrint,can multipliers this i» 
in analytic trick whose justification lies in its equivalence with 
expressions which cm be rigorously derived by more round ibout 
methods 
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Explicit attention is called to the fact that transformation of 
the independent variables of an extremum problem subjects the 
matrix of the relevant quadratic forms to a cogredient congruent 
transformation which leaves the definiteness of the quadratic form 
unchanged, and also leaves the signs of principal minors of a given 
order intact. This is in consequence of the equivalence of our 
equilibrium equations to an extremum condition, and need not be 
true for other types of equilibrium. 

Thus, when Hicks 9 sets down as a condition of perfect stability 
under general market exchange the requirement that all the 
principal minors of a matrix 



be positive, we have as yet no way of knowing whether or not this 
is independent of the particular coordinate system chosen. The 
same may be said of his concept of “imperfect stability.” The 
reason for the ambiguity lies in the inapplicability of the proofs 
given above to asymmetrical matrices. 10 

This section has dealt with the invariance of an extremum posi- 
tion under transformation of independent variables. We turn now 
to the problem of transformation of the dependent variable. 

Transformation of Dependent Variable 

It is by now well known that the equations of demand are 
independent of the particular choice of utility index; 11 i.e., all of 
their properties are invariant under a general transformation 

U = F{<p), ( 32 ) 

where the latter is a one to one, always increasing function. For 
simplicity we may assume that it is twice differentiable. 

In the most general case where f(x) is to be made a maximum 
subject to one or more constraints on the x’s given by 

G(x) = 0, (33) 

* J. R. Hicks, Value and Capital (London, 1939), chap, v and Mathematical Ap- 
pendix, pp. 315-317. 

10 This problem is discussed again later in this chapter. 

11 See chap, v, pp. 94, 99, and 104. 
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the equilibrium conditions are completely expressed by the condi 
tion that be of certain rank, and that the quadratic form 

h'H Ix h be of certain definiteness for values of k satisfying 

h'G s = 0 12 (34) 

Now if we subject z and the G’s to the transformation 

F = F(z) and B = E(G), (3a) 

and ask that the new variable F be maximized subject to the nev, 
constraints our conditions involve the rank of [_F X E Z ~] and the 
definiteness of h'R xx h, subject to 

h'E t = 0, (36) 

where 

= [F,, + Z (37) 

i 

By actual differentiation we easily find the relationships between 
the new and old matrices, namely 

[F,&] = (y,G,][ ^ £ ,° G) ] (38) 

Since the last of these matrices is a diagonal matrix which is to be 
non singular, the first order rank properties are clearly invariant 
under this transformation, as is the actual solution m x itself 
We may similarly verify by differentiation that 

h'RJi = h'lF'f^ + £ m k E k '(G k )G,*Jh 

i 

+ h'lF"JJi -b E m k E*"G*G,*yi (39) 

i 

hrom the first order rank conditions there must be linear de 
pendence between the f x and G z columns of precisely the sort to 
make the second part of the right hand side of the above equation 
vanish for admissible h' s From the monotonic property of t ie 
functions Fand E it follows that [F'E/J are non vanishing, hence 
the transformed quadratic form retains its definiteness 
1J See Mathematical Appendix A Section V 
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This completes a sketchy proof of the invariance of our equi- 
librium conditions. This whole problem is of interest primarily in 
connection with utility and consumer’s choice theory, but our 
theorem applies just as well to the maximization of profit. If we 
maximize the square of profit, or the log of profit, or 90 per cent of 
profit, or that monotonic function of profit which the Treasury' 
tells us we can retain as net income, there will be no change in our 
optimum price and output policy. In connection with the produc- 
tion function the combining of the factors of production so as to 
produce a given output most cheaply could be done as well if we 
worked with some indicator of output rather than with output 
itself; alternatively, we might work with the iso-product contours 
alone, taken without regard to their numbering. However, it is 
important to realize that the variables, profit and output, are 
measurable under ideal conditions; consequently it is artificial and 
unnecessary to replace them by a transformed variable. 13 

Explicit evaluation of the new Lagrangian multipliers shows 
that they have been changed by the transformation as follows: 


m ): — 


Ek(G k ) ' 


(40) 


The significance of this for the marginal utility of income appears 
in the discussion in chapter vii. 


Transformation of Prices 

Thus far I have considered transformations of the unknown 
variables, leaving the parameters, a, unchanged. We are able to 
evaluate changes of the new variables with respect to a by equation 

13 Can those who have a fondness for cardinal utility find support for their position 
by analogy to other disciplines, such as physics? In my opinion, no. Certain physica 
concepts, such as temperature, were for a long time treated by the physicists as ordina 
magnitudes. Their behavior could be described by any one of many different indicators, 
whose relationship to each other was not linear. Finally it was deemed convenient to 
give conventional, privileged position to one such index, namely that based upon t e 
properties of a “perfect gas.” Nor is the use of the word “force” in physics books a goo 
precedent for utility analysis; much of physics can dispense with the term, and in any 
case the things for which it stands, such as rate of change of momentum, or eri\ ative 
of a potential function, can be given strict observational values, which are not invariant 
under monotonic transformation. Perhaps a better precedent for economics is t to 
statistics, where likelihood or its logarithm is used indifferently depending upon nu 
merical convenience, and without metaphysical connotations. 
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(22) However, in the new coordinate system we are no longer 
able to state definite theorems regarding the signs of these changes 
In the old system we could definitely state that 


dvi 

dwi 


< 0, 


(41) 


but our new coefficient (dvi/dwi) is a linear combination of terms 
of the form (dvjdwf), the sign of which can be anything 

And yet, since there was nothing privileged about the first 
coordinate system, it is manifest that there must be somewhere in 
the new coordinate system theorems no less definite than in the 
old As we shall see this “hunch” is correct There are such 
theorems, and they are to be derived by considering the changes 
of our new variables with respect to specified composite changes m 
our parameters That is at the same time that we subject our 
equilibrium unknowns to a transformation, we subject our param 
eters to a transformation The latter transformation is not the 
same as the former, but, as will be shown presently, the transforma 
tion of parameters is related in a definite manner to the trans 
formation of quantities 

For the present purpose, it is not necessary to examine the 
problem in full generality We may consider the important case 
of a firm maximizing profit as in the first section of this chapter 
The case of an extremum under constraint, such as appears in pro 
duction and utility theory, can be developed m a straightforward 
manner by the reader 

Let profits be written m the form 

■k{v,w) - R(v 1 , , v n ) — (42) 

1 

where the tu’s are parameters taken as given by the firm, and the 
t»’s are productive inputs The w’s and w s are just special cases of 
x s and p’s, having specific reference to inputs 

If now we define new commodities (vi, , 1 ), we ought to be 
able to find definite relations between their changes and changes 
in their prices, just as we have been able to do in the old coordinate 
system But what are we to mean by the prices of the new com 
modthes, (u> !p , w n ) Only one answer is possible as must be 

obvious to anyone who has worked with the price of a market 
basket of goods 



TRANSFORMATIONS, COMMODITIES, RATIONING 137 

The price of the latter is reckoned as a weighted sum of the 
individual prices, where the weights are chosen so that the total 
value of the new commodity, equal to wv, is the same as the value 
of the component parts. This suggests a general rule to be fol- 
lowed. Throughout all transformations, value magnitudes are to 
be conserved ; i.e., 

n n 

E w kVk = L WkVk (43) 

1 1 

regardless of the transformation. For this is a value magnitude 
(dollars, pounds, etc.), which is not arbitrary once prices and 
quantities in any coordinate system are given. On the other 
hand, prices and quantities involve the particular reference system 
being used, and are thus arbitrary. 

It is a familiar mathematical problem to subject two sets of 
variables to transformations, now assumed to be linear, so as to 
conserve their inner products as in equation (43). 14 Variables 
related in this way are called contragredient variables . 1S 

If we subject the quantities v to the non-singular linear trans- 
formation 

v = cv, v — c~ l v, (44) 

then 

w'v = w'cv. (45) 

If, and only if, 

w' = w'c~ l , or w — c'w, (46) 

will (43) be realized. Thus, the related transformations of prices 
and quantities are 

v — cv, v — c~ l v, (47) 

w — c~ v w, w = c'w, 

where the transposition and inversion of the matrices is to be noted, 
alopg with the fact that the relationship between the two trans- 
formations is reflexive. 

u This problem arises in higher geometry, in many branches of mathematics, and 
in multiple regression analysis where predicted dependent variables are to be indepen ent 
of transformations of the predicting variables. As the latter undergo a linear trans 
formation, the regression coefficients must be adjusted accordingly so as to eave t 
predictions unchanged. 

15 M. Bocher, Introduction to Higher Algebra, p. 108. 
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After transformation profits become 

*■(*'; w ) = *(»; ®) ~ £(c) — f; a, 5,. ^ 

In form this is the same as the original; consequently, it follows 
from the maximization conditions that everything that could be 
said about ( dv/dw ) can be said about (dv/da). Thus, 

dv , \ , 

- (*»)■* m 

is symmetrical and negative definite. 

This might also have been deduced from the fact that 

dv ^ ^dv_ dw 

da ~ dv dw dm ' ^ ' 

Substituting into this the equations of (47), we find 


dv_ 

da 


c 


dv 

dw 


tr v . 


(51) 


It is a classical mathematical fact that a matrix’s properties of 
negative definiteness and symmetry are preserved after a con- 
gruent transformation of the above type. 14 

There are many equilibrium systems encountered in economics 
which do not arise from an extremum problem and which cannot 
be converted into this form. The various simplified versions of 
the Keynesian system provide but a first example of what is after 
all the general case. A second example is that of the general equi- 
librium equations of Walras. An intermediate case is a third ex- 
ample in which we assume consistent preference behavior, infini- 
tesimal indifference loci at every point subject to certain generalized 
convexities, but without the assumption that “integrability" con- 
ditions are satisfied so that the “local" indifference planar elements 
can be “joined" together to form a family of indifference loci. 
Thus, our preference field may be characterized by a differential 

«• Mathematical Appendix A, p 368 The first proof above can be used to derive a 
proof for this classical theorem Also, if we set c = [a], it is easy to show that the 
inverse of a definite matrix, a, is itself definite. This is related to the "inverse’ profit 
function or “price potential" discussed in chapter tu, p 56 The latter in turn seeffls 
related to the so-called Legendre transformation See A Wintner, Analytical Founda- 
tions of Celestial Mechanics (Oxford 1941), chap i 
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expression, or Pfaffian, for which no integrating factor can be 
found. Only in the case of two goods can we always find an 
integrating factor which will lead to an “exact differential” which 
can be integrated. 17 

The first of the above three examples is of no interest in this 
connection since we cannot pair a set of "price” variables as 
“conjugate” variables to “quantities.” 

Turning to the second example of general equilibrium, I 
should like to point out a grave difficulty with the Hicksian concept 
of stability of general exchange and production. In later chapters 
it is argued that stability is an essentially dynamic concept, and 
that the formal statical treatment of Hicks leads to conditions 
which are neither necessary nor sufficient from a dynamical point 
of view. Here it will be argued that the Hicks stability conditions 
are unsatisfactory from a purely statical point of view. 

This may be illustrated by the somewhat simplified system of 
chapter v of Value and Capital, whose equilibrium is given by a 
set of equations of the form 

X k (pl, ■ ■ • , Pn) = 0. (fe = 1 >•••,») (52) 

To be perfectly stable the Jacobian, [dx/dp^, must have all its 
principal minors oscillate in sign. A matrix with this property I 
shall call Hicksian. To be imperfectly stable, we only impose the 
weaker condition that the ( n — 2) and (n — 1) order minors be of 
the opposite sign. 18 

Since no particular definition of commodities or frame of refer- 
ence is fundamental, if the Hicks conditions are fundamental, they 
must be invariant under transformations of price and quantity. 
A single example suffices to show that a matrix a which is Hicksian 
becomes under transformation c~ l \_dx/dp~\c~ v , and this need not be 
Hicksian, nor even imperfectly stable. 

17 The problem of integrability goes back to the “classical discussions ^of Irving 
Fisher and Pareto. In their “A Reconsideration of the Theory of Value, Parts I 
and II, Economica, XIV (1934), 52-76, 196-219, Hicks and Allen touch upon it, but 
not in an altogether satisfactory fashion. The most illuminating treatment is that o 
Georgescue-Roegen, “The Pure Theory of Consumer’s Behavior,” Quarterly Journal of 
Economics, L (1935-36), 545-593. See also the discussion of chap, v, p. 95 above, and 
P. Samuelson, “A Note on the Pure Theory of Consumer’s Behavior,” Economica, V 
(1938), 61—71, 353-354, for some unsolved problems in this connection. 

18 Value and Capital, pp. 67, 315. 
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Thus for 

dx _[ -1 10 ] 

dp~ L 0 -1J 

m; n 

we get 

dx _ f 96 9 6 ] 
dp = L 0 96 J 

which is neither perfectly stable nor imperfectly stable 

Of course if {* dx/dp~\ had been symmetrical and definite (as 
when income effects can be neglected) this could not have hap 
pened but then our system would have been convertible into a 
maximum problem 

I have shown elsewhere that convexity in non integrabte cases 
implies that the symmetrical part' of certain matrices be 
definite This follows in consequence of the relation 

53 Ax\A/\ s 0 (55) 

1 

where the p s and the x s may or may not be under constraints 
By going to the limit we find that [ dxfdp + dx/dp’"]/2 and 
[ dp/dx + dpldx’~\ll must be non positive 

It is not hard to show that this property, which we shall call 
quasi-defimteness is invariant under redefinition of commodities 
or under a general non singular congruent transformation Thus 
let [a + a'~]/ 2 be negative definite Then the quadratic form 

h’ak a h' h + 0 (56) 

is negative definite Conversely, if h'ah is negative definite, then 
a is negative quasi definite 

With the aid of a non singular matrix b w e subject a to a con 
gruent transformation so that 

a = b'ab (57) 

** Every matrix a can be decomposed into a symmetrical and an asymmetrical 
part Thus 



(53) 

(54) 
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Then 

h'ah = [h'b , ']a[bhf\ = h'ah , (58) 

and the extreme right-hand member is always negative, as must 
be the left-hand side. Thus, \_a -{- a'J/ 2 is negative definite, and 
a is negative quasi-definite. 

The above analysis may be of interest even where there is in- 
tegrability. If income effects cannot be neglected, the Jacobians 
of general exchange equilibrium need not be symmetrical. How- 
ever, if the income effects are symmetrical and definite, or if their 
symmetrical parts are definite, or if their symmetrical parts are not 
so lacking in definiteness as to outweigh the definiteness of the 
substitution effect, then the Jacobians will be quasi-definite. Not 
only is this property preserved under transformation, but it is 
possible to establish the Theorem: If a is quasi-definite, it is 
necessarily Hicksian, but not conversely. 

This important theorem can be proved in a variety of ways. 
One is to write each principal minor of a in a Taylor’s expansion 
around [a + a'~]/ 2 so that the skew elements (a,,- — a jt )/2 appear 
in a power series. Then it can be shown that all of these elements 
enter in even powers so that the complete a matrix is “more 
Hicksian” than its symmetrical part. This confirms my previous 
remark that asymmetry, per se, makes for greater rather than less 
stability. The real significance of quasi-definiteness can only be- 
come apparent from the dynamical discussions of stability in 
later chapters. 

Demand for a Group of Commodities 

Value and Capital will take its place in history along with the 
classic works of Cournot, Walras, Pareto, and Marshall. Like the 
latter, Hicks has succeeded in keeping formidable mathematical 
analysis below the surface of things and locked up in appendices, 
thereby securing for his work a much wider audience than would 
otherwise be possible. This tour de force was made possible in 
considerable degree by the repeated use of the already mentioned 
theorem relating to the demand for a group of commodities when 
their prices all change in the same proportion. We are now in a 
position to derive this theorem rigorously in a more general form. 



142 


FOUNDATIONS OF ECONOMIC ANAL) SIS 


When a group of prices say (pi p) all move together 
it is natural to define a new commodity a by the relation 

x* = P 1 X 1 -f pix s -f -f px (59) 


Strictly speaking we are not thereby enabled to replace n com 
moditics by (« — r + 1) commodities We are merely changing 
our frame of reference and so we must lose no dimensions or degrees 
of freedom in the process Our complete transformation of quan 
titles can be given by 


Here we have simply defined all commodities but one to be iden 
ticaliy the same as before M the first commodity has been replaced 
by the new composite commodity The contravariant price van 
able must then satisfy the equation 



so as to leave T. px = £ px Even though ( n — 1) commodities 
are the same as before their prices must have changed This 
will not seem strange if it is remembered that a price is not a 
property of a good itself independently of the frame of reference 
involved In fact it is quite possible for either prices or quantities 
to Income negative although £ px will conserve its original sign 
Now if we change prices (pi p ) in the same proportion X 
holding constant (p + i p a ) it can be shown by explicit differ 


t# Th s s only one of an nfin te number of poss ble transforms ons wh ch w It serve 


present purpose 
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entiation, dp/dh, that in the new price-quantity coordinates only 
the pi is changed with all other p’s remaining constant. Conse- 
quently, if this price changes, and if income is changed in such a 
way as to keep utility constant, then 


Also 


dpi ~ Ku < 0. 


K11K22 — Kv? > 0, etc., 


( 62 ) 

( 63 ) 


so that all of the properties of the demand functions given in the 
previous chapter are satisfied. 

Thus, the demand or a group whose relative prices are unchanged 
satisfies the same inequalities as in the case of a single good. 

This is the fundamental Hicksian theorem. 21 It can easily be 
broadened so that we may work with n) composite goods. 
Their substitution terms form a negative definite quadratic form. 

This is a very useful theorem. It states that any group of 
commodities whose relative prices remain unchanged can be lumped 
together into a single commodity, and that indifference curves in 
fewer dimensions can be constructed which will have all the usual 
properties of indifference curves. Thus, all goods but one can be 
lumped into one commodity which Hicks calls "money,” and then 
concave indifference curves can be drawn between the good in 
question and “money.” 22 

It would seem that this involves rather a strained use of the 
term "money,” and one almost certain to lead to confusion. 23 
Very properly when Hicks later comes to discuss monetary matters, 
he rejects this earlier notion. 

21 Proved in his Mathematical Appendix, pp. 311-312, as a consequence of his sixth 
rule relating to substitution terms. All six of these rules are contained in the statement 
h'[_Xif\h is non-positive definite quadratic form of rank (n — 1), which vanishes for 
values of h proportional to prices, this being an immediate consequence of the primary 
conditions of equilibrium and the secondary conditions guaranteeing an extremum 
under constraint. 

22 Value and Capital, p. 33, passim, diagram on p. 39. Also see A. G. Hart, Peculi- 
arities of Indifference Maps Involving Money,” Review of Economic Studies, VIII (1941), 
126-128. 

23 Such an instance is provided by a writer who interprets Hicks literally and con- 
fusedly attempts to throw light on the “inflationary gap” by means of an indifference 
diagram in which money is taken as a catchall for all but one commodity. M. W. Reder, 
“Welt are Economics and Rationing,” Quarterly Journal of Economics, vol. LVII (1942). 
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The General Problem of Composite or 
Aggregate Commodities 

Any economic system when viewed carefully will be found to 
consist of an almost uncountably large number of variables It is 
almost a necessity if advances in analysis are to be made to simplify 
matters artificially so as to reduce the number of variables which 
are to be handled This is done in a number of different ways 
Some authors retreat to a one or two commodity world m order to 
derive more precise results the penalty for this lies in the difficulty 
in establishing the relation between the simplified construct and 
complex reality This however is at least an honest procedure 
Other writers wish to have their cake and eat it too to work with 
only a few variables and at the same time retain an air of realism 
and versimihtude 

Here too a vanet> of artful dodges are open to the investigator 
(1) He may hold other things equal (2) He may concentrate on a 
representative firm or family, each of whom is doing about what 
all are doing (3) Or he may work with certain aggregate, com 
posite magnitudes such as bales of output, “socially necessary' 
labor wage units " cost of living, real national output, etc 

It is the third device which is of interest here There is 
nothing intrinsically reprehensible in working with such aggregate 
concepts On the contrary, abstraction from complexity is a 
necessary thought process And in any case the most general 
equilibrium set up must necessarily stop short of the full total of 
all possible economic variables But it is important to realize the 
limitations of these aggregates and to analyze the nature of their 
construction 

Broadly speaking we replace a number of variables by a single 
variable under two diametrically opposite conditions In the first, 
the variables hav e each the same effect (except possibly for scale 
differences which disappear upon redefinition) upon all of the eco 
nomte functions under discussion In this case they can be 
summed and treated as one Thus, we might define as the same 
commodities all goods which have exactly the same influence on 
consumption preference fields and production functions which are 
so to speak infinitely substitutable In practice rigid adherence to 
perfect substitutability might lead to the unmanageable result that 
> two things are quite the same Consequently, goods which do 
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not significantly (for the purpose at hand) differ are treated as 
identical. 

Actually the above case of substitutability is but one example 
of the more general mathematical theorem whereby the identical 
vanishing of a Jacobian, or of all of its minors of a given order, 
implies that there exists functional relationships between sets of 
the variables, so that many may be dropped out of the picture 
completely. It is clear from the implicit function theory that such 
simplifications are possible only in the case where the original 
system was indeterminate . 24 This may or may not be a matter of 
concern. What if the variables which are extraneous cannot be 
given determinate values by the system of equations which define 
equilibrium? The indeterminate variables may be a matter of 
indifference to the economist. Thus, take any determinate system 
of economic goods. Let each unit of one type of good be marked 
with an invisible serial number, and now let us ask how many odd 
numbered units will be bought by a given consumer. Clearly the 
answer is indeterminate, but also of no possible interest. If con- 
sumers did have preferences as between different serial numbers, 
then the indifference schedules would be affected by this fact, and 
additional equations would be available to determine the final allo- 
cation. And notice that even in the case of complete indifference 
the final allocation is not really indeterminate on any particular 
occasion. It is just that the determining factors, which are taken 
as due to “chance” by the economist, would be of a different 
character. 

Unlike the first condition under which variables are lumped 
together because they are infinitely substitutable, at the other ex- 
treme we combine variables which bear an invariant, “perfectly 
joint” relationship to each other. A notable example is provided 
by the classical “dose” of labor and capital applied to land. Here, 
too, the economist customarily relaxes the rigid requirement of 
perfect collinearity in favor of an approximate condition. Thus, 
the most primitive justification for a price level construct is pro- 
vided by the indisputable fact that prices do generally rise and 
fall in about the same proportions. 

21 This must be qualified. Our original equilibrium equations may be imbedded 
in a still wider set of equations so that the total is determinate, but the original subset 
taken by itself is not. 
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Geometrically this differs from the first case in which indiffer- 
ence cun es or iso quants are straight lines Here the contours are 
broken lines meeting at right angles in such a way that the goods 
or factors will by obvious choice be combined together in the given 
proportions almost regardless of price ratios Unlike the Hicksian 
bundle of goods the physical composition is the same regardless of 
relative prices 

Of all composite magnitudes perhaps the most interesting to 
the theorist is that of an index of the cost of living, or of an index 
of consumption output Such an index is designed to meet certain 
special requirements and need not be satisfactory for other pur 
poses In particular, it need not represent the desideratum from 
the standpoint of the questions which Jevons and other pioneers 
m the use of index numbers sought to answer Yet the theory of 
these indices is of some interest for its own sake and because in the 
course of investigation of these index numbers economists inad 
vertently stumbled upon certain ordinal relationships which arc 
basic to welfare economics and to consistent consumer’s behavior 

The Economic Theory of Index Numbers 

Economists such as Jevons, Edgeworth Marshall, Allyn Young, 
Warren Persons, Irving Fisher, Edwin Fnckey, and others have 
made contributions to what may be called the statistical theory of 
index numbers But what has come to be called the economic 
theory of index numbers ts concerned with quite distinct matters 
To this theory many economists have made contributions A 
partial list would include the names of Wicksell, Konus, Bortkie 
wicz, Bowley, Haberler, Pigou, Keynes, Staehle, Leontief, Allen, 
Lemer, Frisch, and Wald 55 

With the exception of Leontief all of these writers are unduly 
preoccupied with the problem of the price index number, instead of 
being occupied with the more fundamental question to which it is 
only a partial, and somewhat arbitrary, answer The fundamental 
problem upon which all the analysis rests ts that of determining merely 
from price and quantity data which of two situations is higher up on 

** Reference may be made to the survey article by R Frisch ‘Annual Survey of 
General Economic Theory The Problems of Index Numbers Economelnca IV (1936), 
1-38 also to the article by \V Leontief m the same issue The early volumes of the 
of Economic Studies may be consulted for other discussions 
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an individual’s preference scale. This problem admits of a partial 
answer if certain rigid assumptions are fulfilled. 

It is assumed that there is only one individual, whose tastes do 
not change in the period under consideration ; or if there are more 
than one, that their tastes are identical. It is not necessary to 
assume that the regime of commodities is the same in the two 
situations provided we follow the convention of setting at infinity 
the price of any good which is unavailable. At whatever the 
prices named, the individual is presumed to buy as many or as 
few of the goods as he wishes. This rules out such phenomena as 
rationing or monopsony. Ordinarily, a comparison is made be- 
tween two situations which differ in time, but we might just as well 
compare two situations different in any respect, as in the case of 
comparisons in the cost of living between two regions. 

Our fundamental data consist of prices and commodities, in- 
cluding productive services which can be treated as negative com- 
modities, in the two situations, represented respectively by (P a , X a ) 
and ( P b , X b ), where these are shorthand notations for the prices 
and quantities of n goods. Of course, if we know completely the 
individual’s whole preference field, we may simply insert into it 
the two amounts of commodities and read off which is the better, 
or whether they are indifferent. Since we do not know the field, 
our problem is to go as far as we can on what we do know. 

If for the moment we restrict ourselves to the quantity data, 
we ordinarily cannot say which of the two situations is the better. 
However, in the exceptional circumstance in which one of the 
situations contains more of some of the goods than the other, and 
not less of any of the other goods, then it is clear which is the better. 
Taking situation A as our reference point, and thinking of the 
simpler two-dimensional case, we can definitely divide up the whole 
of the commodity space into four regions with respect to X a by 
letting two perpendicular horizontal and vertical lines (not shown 
in the figure) intersect at this point. Treating this as the origin, 
we can definitely classify all points in the northeast quadrant as 
better than X a , and all points in the southwest quadrant as defi- 
nitely worse. We are left then with ignorance concerning the in- 
terior points of the other two quadrants. In these more of one 
commodity has been acquired at the expense of less of another, and 
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until we have further information concerning the preference field 
this is as far as we can go 

But it should be possible to utilize our price information If 
the individual is supposed to be maximizing his ordinal preference 
field subject to the given budget relation we may be sure that the 
point X tt is better (or not worse) than any point between the budget 

RE0ON OF IGNORANCE N ANALYSS 
OF NOEX NUMBERS 



locus and the axes For all such points were available to the in 
dividual in the initial situation and he nevertheless preferred to 
select X 0 It follows that for all such points 

U{X) S U(X-) (67) 

Thus we have narrowed down the region of our ignorance but not 
completely 

It will be noted that the two situations have not been treated 
symmetrically A always, beuug tiva xafeiawca It w.vgb.t well 

happen that X a lay inside the budget line of the point X b Assum 
ing for the moment that knowledge of the complete preference field 
were available for what values of X b would this be possible? Ap 
plying the requirement that X * must cost less than X b at the prices 
P b the boundary of this region will be given by the equality 

E P b X a = £ pb X b 


( 68 ) 
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Geometrically, this locus is the result of swinging the budget line 
through the point X a , and determining for each position the point 
where it reaches the highest indifference curve, to which it is 
clearly tangent. If we connect all such points, we have a familiar 
offer curve. All points above this offer curve, are definitely better 
than X a , in the same sense that all points under the budget line 
of X a are definitely inferior to X°. (It is to be emphasized that a 
knowledge of two points does not give us this locus in the way that 
it gives all the previous boundaries. Nevertheless, given two such 
points, it would always be possible to decide just where X b was 
with respect to X a .) 

We have now narrowed our ignorance down still further. Ac- 
tually this is as far as we can go on the basis of the given data. 
Note that the old boundaries of our ignorance derived from the 
utilization of quantity data alone have been made obsolete by the 
additional light furnished by the auxiliary price information. 

We have gone a long way, but there is still left a definite area 
of darkness — the space between the two heavy lines. I should 
like to state as strongly as possible that this final indeterminacy is 
intrinsic and inherent. No amount of ingenuity can remove it, 
grounded as it is in the fundamental convexity properties of the 
indifference field, or more accurately in the consistency behavior of 
the individual. It is important to prove this rigorously, for pe- 
culiarly in the literature of index numbers is an attempt made to 
search for limits within which the truth must lie without at the 
same time investigating whether or not these are the best possible 
limits. Moreover, the limits themselves are sometimes derived 
under special approximations, such as the neglect of “squares of 
small quantities,” etc. 

To see that these are indeed the best possible limits under the 
circumstances, let us suppose that someone proposes narrower 
limits. Since our preference field is arbitrary except possibly for 
certain curvature properties, we can draw the true indifference 
curve through X a so as to contradict any more definite result. If 
the proposer claims that a given point in the area of darkness is 
worse than X a , we can pass the indifference curve below that point 
but above the correct lower boundary in such a way as to make this 
statement wrong. Similarly, the opposite statement can be shown 
not to hold universally. (It is of course understood that “we do 
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not actually change the indifference field of the economic unit under 
observation, but "we" can find a consistent field for which the 
given result holds } Where the offer curves bend back we are able 
from quantity considerations alone to extend our boundaries along 
the dotted lines indicated in the figure 

For well behaved indifference fields with the proper continuous 
derivatives, the offer curve which gives the upper boundary of the 
area of darkness will be tangential to the indifference curve going 
through X tt and to the lower budget boundary through X a Even 
if there is a comer m the indifference curve at X a , there must be 
generalized tangency m the sense of touching from above without 
crossing One might be tempted to say that the area of mde 
terminacy narrows down to a point m the vicinity of X a , but 
nothing is gained thereby Nevertheless, the mathematician is 
tempted to consider Taylor’s expansions m the neighborhood of 
the X 9 point and to neglect terms of higher order Perhaps there 
is even some empirical statistical usefulness in the handling of 
budgetary data to be found in this practice, which has been associ 
ated with the names of Bowley and Wald 

But from a fundamental viewpoint there is no getting around 
the fact that for any finite move, however small, there remains a 
region of ignorance, this region may narrow down as the size of 
the movement decreases, but it never vanishes, except for the 
trivial case of a vanishing movement 21 Thus, there is nothing to 
be gatned for the present purpose from following the practice of 
Divisia 27 m working with differentials or infinitesimals The fact 
that these avoid difficulties connected with the time reversal and 
factoral reversal is not an indication of their superiority, but of the 
fact that they sidestep the intrinsic difficulties of the subject 
matter 

By the examination of value sums we can sometimes, but not 
always, state definitely whether one situation is worse than another 
we can never 'oy tnese means state tiiat two situations are 
equally desirable In fact, as will become evident from later dis 

u It may be worth mentioning that even a knowledge of the curvature of the in 
difference curve at X 4 will not enable us to narrow down our region of ignorance It 
will enable us to assert with confidence that the indifference curve will approach arbi 
tranly near to the osculating circle, but the departures for any finite small movement 
may be of either sign and of any magnitude 

17 F Divisia, Econointque Ralicmnelle (Paris 1928) 
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cussion, knowledge of prices and quantities on a finite number of 
points will not permit us to determine the equality of two points; 
but in the limit as the number of points becomes infinite we can in 
fortunate circumstances determine points of indifference. 

This result may be approached by a consideration of the addi- 
tional information which a third point will yield. We now have 
three pairs of points, and it may turn out that any two of them 
when put to the test of our computed value sums gives a definite 
answer. In that case each pair can be considered by itself without 
regard to the third point. Of course, from the discussion of chapter 
v it will be clear that a consistent ordinal preference field can never 
give contradictory testimony whereby point A is better than B, 
which is better than C, and at the same time the latter is better 
than A. For the relations of ordinal utility are transitive. 

But the relations of better or worse as revealed by the value 
sums are not transitive ; that is why I have elsewhere proposed a new 
notation to represent “revealed” preference in this special sense. 
Thus, if our value sums give a definite result so that 

£ P a X b £ P«X«, (69) 

this fact may be represented by the symbol 

X 4 © X°. (70) 

Since 

X 4 © X* implies U{X h ) < U(X°), (71) 

and this in turn implies 

U(X a ) < U(X b ), (72) 

then to avoid a contradiction in logic, we must be able to state 
the theorem 

X b © X a implies X° © X 4 . 

But this is not the same thing as the meaningless assertion that 
X 4 © X« implies X« © X 4 , (73) 

or the incorrect assertion that 

X 4 @X“ and X‘©X b implies X C @X°. (74) 
The most that can be stated under the above hypothesis is that 

X° © X e . (75) 



152 FOUNDATIONS OF ECONOMIC ANALYSIS 

This is much weaker than transitivity An additional indication 
that the algebra of revealed preference is quite distinct from that 
of cardinal or ordinal numbers is the fact that equality is not 
defined consequently, two points cannot be placed in one of the 
three categories A worse than B, or B worse than A, or the two 
equally good All we can say is either A is revealed to be worse 
than B or B is re\ealed to be worse than A or there is no indica 
tion one way or another These are mutually exclusive categories 
only if a consistent preference field is postulated 

It is precisely because of this lack of transitivity that know ledge 
of a third point may add knowledge to a comparison between two 
given points Our value sums may giv e no indication with respect 
to points X* and X * taken by themselv es but an intermediate point 
X b may serve to indicate their true ordinal relationship to each 
other We are in a position now to indicate just how much points 
X b and X e may narrow down our ignorance in relationship to X“ 
If the two additional points both lie in our old area of darkness 
we are no better off than before But if X b lies in the upper region 
of certainty with respect to X° and X‘ lies m the upper region of 
certainty with respect to X b , then even though X c lies m the X* s 
region of uncertainty it can still be definitely said to be better 
than X° Therefore we have narrowed down our area of darkness 
How much can this area be reduced in the most favorable case* 
From the geometry of the problem it can be shown that X b must 
lie on X“ s offer curve for the best results We then proceed to 
draw the offer curve through X b This will cross the old one and 
thereby narrow our region of ignorance since every point above 
the new offer curve is definitely better than X B If now we let A® 
take each and ev ery position on the old offer curve the process can 
be duplicated as many times as we wish, giv ing us a one parameter 
family of new offer curves The lower envelope to this family of 
curves gives us our new upper boundary of uncertainty 

In the same way, we can secure a best lower boundary by letting 
X b travel along the original budget line, generating through each 
such point a new budget line or a one parameter family of such 
lines in all The upper envelope of this family of lines is our new 
lower boundary Our new boundaries must of necessity lie within 
the old ones but there is still left an area of darkness 

If a fourth point is added we may narrow our ignorance still 
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further; similarly with a fifth point. It is intuitively clear that 
the boundaries will never meet for any finite number of points; but 
in the limit as the number of points becomes infinite, the upper and 
lower boundaries approach a common limit, which is of course the 
indifference curve through X a . Points along this curve, and only 
such points, can never find their place in a finite chain of points 
which serve to relate them unambiguously to the initial situation. 
Thus, with a single dimensional infinity of points any indifference 
curve may be traced out. With a two-dimensional infinity of 
points, the whole indifference field can be determined. In fact, 
with this much knowledge we may dispense completely with re- 
vealed preference, and instead "integrate” completely our elements 
of slope at each point into a one parameter family of indifference 
curves. 

The above discussion is concerned with the narrowing of our 
field of ignorance under the best circumstances. In actual practice 
we shall not necessarily do nearly so well. An instance is provided 
by the frequently met case in which we know a complete budget 
line (expenditure path, income consumption curve, etc.) giving the 
behavior of changes in all of the commodities with changes in 
money income but with unchanged prices. Such a situation is 
provided by observations of the behavior of more or less similar 
people, all confronted with the same prices but differing with 
respect to total expenditure. 

Not only is this of considerable statistical importance, but it is 
of special relevance in connection with the usual economic theory 
of index numbers, where a comparison is made between two price 
situations, rather than between two particular price-commodity 
situations. By applying our previous analysis it will be seen that 
the knowledge of an infinite number of points along two budget 
lines will not be sufficient to determine the whole preference field, 
or even a single indifference curve, or even be sufficient to enable 
us to match points of equivalent satisfaction on the two budget 
lines. If we select a given point on one line and through it draw 
a budget line and an offer curve, then these last two loci will split 
the other budget line into three parts. The lower part will consist 
of points all worse than the initial point, the upper part will consist 
of points all better than that point, while the intermediate part will 
constitute our area of indeterminacy. 



154 


FOUNDATIONS OF ECONOMIC ANALYSIS 


Of course the closer together are the two price situations the 
smaller w ill be the indeterminacy Therefore we can narrow down 
our ignorance by a knowledge of many intermediate budget lines 
In the limit as the number of budget lines becomes infinite in such 
a way as to decrease indefinitely the distance between each we 
shall approach the complete indifference field 

The value sums considered up until now have all had the dis 
advantage of being capable of yielding indeterminate results We 
shall put up with this if need be but first we must determine 
whether it is not possible to devise an index of quantity computable 
only from the prices and amounts of all the commodities which 
will be an unfailing indicator of ordinal utility Can w e not find a 
magic formula which will have these properties when applied to 
any preference field whatsoever or at least to those of the proper 
convexity? 

The answer is no As jet no one has devised such a formula 
and the following mathematical reasoning shows why no one ever 
can even if we permit more general functions than simple value 
sums Any such general formula will from the nature of the 
problem be a function of quantities and prices in one or more 
(usually two) situations Without loss of generality we may con 
sider the prices and quantities of all but one of the situations as 
fixed while the one situation represents an arbitrary variable point 
In terms of its prices and quantities (PA) we must be able to 
construct a single -valued function Q which is constant so long 
as the X points remain on the same indifference curve of the 
(unknown) preference field Because this is to be a quantity index 
it is clear that relativ e prices only can be important and so w e may 
express all prices in terms of the first good as numeraire Then Q 
may be written as 



For observable equilibrium quantities the ratios of prices are equal 
to marginal rates of substitution (*i? J l R n ) or to ratios of 
marginal utilities ( U t /Ui U n /Ui) Although the form of the 

relationship will change from preference field to preference field 
and is in any case unknown to us there will still nevertheless in 
^ specific instance be a functional relationship between these 
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ratios and quantities of goods. Therefore Q will be a uniquely 
determined function of the x’s alone. In fact, since Q is constant 
along an indifference curve, it must itself be one cardinal index of 
utility; its partial derivatives with respect to each commodity 
(computed by adding to the direct effect of each x on Q all indirect 
effects via influence on price changes) can be interpreted as marginal 
utilities, whose ratios are equal to observable price ratios, etc. 

In consequence of this last fact, equation (76) can be written 
in the form 


( 


Xi, 


X n 


dQ /8Q_ 

dxi / dxz ’ 


d_Q_ J dQ_\ 
dx 1 / dx n J 


-Q = 0 . 


(77) 


This identity is a first order partial differential equation which the 
preference field must satisfy, and only a limited small subset of all 
preference fields will do so. Consequently we have proved the 
impossibility of finding a magic formula to serve as a quantity indi- 
cator in the general case of a consistent ordinal preference field. 

In actual fact it is customary to restrict the form of a quantity 
index still further, to require that it be a homogeneous function of 
the first order in the component quantities, etc. This restricts the 
range of applicability still further, to preference fields possessing 
expenditure proportionality, etc. 

For any particular 6 it may be a problem of some mathematical 
difficulty to determine the exact restriction on the preference field. 
But where 6 represents the Laspeyre quantity index, thought of as 
a function of the quantity components of the second point, with 
base point fixed, it will be immediately seen that the only possible 
preference field for which it is exact is that of a family of parallel 
straight line indifference curves. Of course, this is almost a re- 
ducto ad absurdum, since convexity is denied, and since all relative 
prices (of goods which are bought) cannot vary. 

The “ideal index number” is one of the most popular index 
numbers. It is the geometric mean between Laspeyre’s and 
Paasche’s index numbers. A. Konus and S. Buscheguennce, 28 as 
well as S. Alexander (in an unpublished Harvard paper), have 
shown that it is exact only for certain hyperbolic indifference 
curves. This theorem may be derived by solving explicitly the 

55 See the reference in H. Schultz, “A Misunderstanding in Index-Number Theory,” 
Economelrica, VII (1939), 8 . 
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implied differential equation Similar results can be derived for 
other approximate” formulae which have lieen suggested 

As a result of the analysis of this section the conclusion may 
be ventured that the important economic content of index number 
theory resides in the fact that it attempts to utilize limited price 
and quantity data to infer ordinal preference comparisons The 
above formulation seems best designed to reveal this essential con 
tent, and to show the intrinsic limitations necessarily involved 
In the next section the same analysis is shown to be useful in 
connection with the more commonly met formulation of the index 
number problem 


Present Formulations of Index Numbers 
We are now in a position to apply our tools of analysis to the 
more familiar branch of index numbers First, we consider price 
of living comparisons between two different price situations The 
price of living index number in going from the situation (AT*) to 
( X b ) is familiarly defined as the ratio of the cost of the cheapest 
bundle of goods at the prices of the second situation which wall 
yield satisfaction equivalent to that of the initial situation to the 
cost of the initial bundle at the initial prices 

We assume as known (say from empirical studies on 4 identical 
individuals with varying incomes) the price expansion paths in two 
price situations ( P a ) and (P b ) These are defined by the followang 
sets of parametric equations 

x, = h,{p i\ ,£„•,/), (*=1, ») (78) 

x t ^h t {p ,p n \I) (i«l, ,«) (79) 

Consider an initial situation (X*) on the initial expansion path 
The indifference locus corresponding to this point will intersect 
the second expansion path in a point called (A'-' 1 ) such that 

U(X ab ) = U(X a ) (80) 

An index number of the price of living is defined as follows 


ZP a X a 


(81) 


I* = 


£ P b X b ’ 


Similarly, 


( 82 ) 
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where 

V(X*) = U(X b ), (83) 

and ( X ba ) lies on the “initial” expansion path. 

These respective index numbers are not rigidly related; in 
general they are not reciprocals. Since we do not know the hypo- 
thetical preference field, ( X ab ) is not known, and so our index 
number cannot be computed. 

However, from the considerations indicated above we know 
that all of the points on the expansion path ( b ) fall into three 


classes with respect to (X°) : 



1 . (X)©(X“) 

E P°X ^ E P a X a 

(84) 

2. (*“) © (X) 

E P b X a E P b X 

(85) 

3. (X) © (X") 

E P a X > E P a X a 

(86) 

(X°) © (X) 

E P b X a > E P b X. 



In particular, consider the equality signs in (84) and (85) and we 
get the boundary points defined by the intersection of 


E P h X = E P'X a 


(87) 

and 

Xi = hi(pi b , ■ ■ ■, pn, I). (i = 1, • • 

■, 7l) 

(88) 

Call this point ( u X ab ). 

Consider the intersection of 

E P a X = E P a X a 


(89) 

and 

= hi(pi b , ■ ■ pn, I)- (i - 1. • • 

■, n) 

(90) 

Call it ( l X ab ). By definition 

( l X ab ) © (X a ) © ^X ab ). 


(91) 

Hence, 

U{ l X ab ) < U(X a ) < U( u X ah ), 


(92) 

or 

U( l X ab ) < U(X ab ) < U( u X ab ). 


(93) 


Since along an expansion path the cost of a batch of goods and its 
“utility” are monotonically related, we have 

■£ P b l X nh < E P b X ab < E P h u X nh . 


(94) 
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Hence dividing by the appropriate factor (if it does not vanish) 
we hav e 


E P b l X ob ^ E P b X ah 

^ E P b "A 04 

(95) 

E P°X a ' E P a X a 

E P a X a ’ 

Y,P*‘X‘» < 

E P* X a 

E P b u X ab 

EP a X a 

(96) 


These are valid double limits for our index number computable 
under our specified assumptions More directly from the defmi 
tion of (“A 10 *) we have 

ZP bu X aH = L-PA- (97) 

Hence our double limits are 


E P b ‘A°* 
Z P°X a 

By symmetry we have 

E P° l X ba 
E P b X b 


< lot, < 


< I*” < 


EP" A“ 
EP a X a 


EP a A* 
E P b X b ’ 


(98) 


(99) 


where ('A fca ) is symmetrically defined with respect to ('AT 46 ) 
Again these four limits are not rigidly related no pair being neces 
sarily reciprocals Furthermore it follows from the analysis of 
the earlier sections that these are the best possible double limits 
under our hypothesis and will never converge to equality for 
convex indifference curves 

It is clear that the two upper limits arc determinable from price 
and quantity figures alone The points (‘AT 4 *) and (‘AT 64 ) however 
can be computed only if certain subsections of the respective ex 
pansion paths are empirically known Dropping this assumption 
we may ask the question as to what lower limits can be computed 
from price and. (quantity figures alone This being a more difficult 
problem the answer will in general be less satisfactory 

Recall that ^Af 06 ) was the intersection of the budget plane of 
(AT a ) with the expansion path of the second price situation Since 
we do not know the expansion path we cannot compute ('AT 06 ) 
But if we restrict ourselves to positive quantities it is always 
possible to find a point on the budget plane whose cost is less than 
ex' 14 ) and which must a fortiori be a lower limit Such a point is 
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that point on the budget plane whose cost at the prices ( P h ) 
furnishes a minimum with respect to all points on the budget plane. 
Let us designate this point ( x X ab ). It may be derived by 


minimizing 


F = E pH. 

1 

(100) 

subject to 


E pi a Xi — E pi°Xi a . 

1 1 

(101) 

This is a constrained minimum problem, and, since the equations 
are linear, the minimum is a restricted or boundary one, derived 
from the fact that no negative quantities are admissible. Ac- 
tually, it is easily verified that the minimum quantity ( x X ab ) is 
(0, 0, • • • , x x m ab , • • • , 0, 0) where x m is the good whose price ratio 
between the two situations is lowest, i.e., , 

PJ £ Pi” 

Pm“ ~ PP 

(102) 

( x X ah ) is easily computed since 

** - E P°x\ 

ym 

(103) 

Hence this lower limit developed by Mr. Lemer is as 

follows: 

T P b X X* pJ E P a X a p m b 

T P a X a " p m a E P a X a p m a 

(104) 

By symmetry, pi a /pi b is a lower limit for I ba , where 


Pi a *rPi a 

Pi b Pi b 

(105) 

In particular cases other lower limits are possible, 
that our actually observed situation ( X b ) is such that 

It is possible 

OT © pr»), 

(106) 

I ‘ e ‘’ z P a X b ^ E P a X a . 

(107) 


Recalling the definition of ( l X ab ), obviously 

T P a X b STP a 'X* 



i6o 

and 
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£ P b X b < £ P b l X* t 

and so in this case we have a lower limit as follows 


£ P°X° 


< I ab 


(109) 


Note that this holds only if (106) is realized 
given 

(*•> © OT. 

we have 


£ P°A' a 
£P 6 A> 


< 


On the other hand 

(HO) 

( 111 ) 


Except in the coincidental case where the equality sign holds these 
limits will be worse than those deducible from the w hole expansion 
path They may be better or worse than the Lerner limits given 
in (104) depending upon the case selected Note that it is lm 
possible for 

<*-) © (**> 

simultaneously with 

(*»)©<*•) 

Hence it is not possible to derive lower limits to both indexes 
simultaneously In fact given 

(*>)©(*») 

and 

<X») © (X b ), 

it is impossible to compute either lower limit in this manner 

If we widen our assumptions as to initial information still other 
limits are possible Knowledge of a third point may be utilized 
by the methods of the previous section , as may also knowledge of 
any intermediate expansion paths In fact m the limit as all inter 
mediate expansion paths are known i e as we know the functions 


Xi = h (pi , p n I), (* = 1 , , m ) 

the indifference map itself may be solved for implicitly ** 

It is possible to develop similar relationships between usual 
quantity indexes Let us define our quantity index from the oth 

** I do not d scuss the effect of imposing the assumption of monotonic expansion 
since there is no reason to rule out inferior goods 
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situation to the 6th as follows: 

O* = L pbXb _ £ P°X° P 

v £ P b X ab £ P*X a £ ~ pp* ’ (H2) 

i.e., this index is the ratio between the actual cost of the batch 
( X b ) to the cost of the cheapest bundle at (P b ) prices which would 
yield an equivalent satisfaction to that of (X a ). 

Similarly, 

£ P a X° p 

Q ba = ^ P a X ba ~ I b P a ’ (H3) 


Obviously, if one situation is preferred to the other, the index 
number between them will be less than one, i.e., 



U(X a ) < U(Xh) implies Q b « < 1, 

(114) 

and conversely. 

Likewise 



U{X a ) < U{X b ) implies Q ab > 1. 

(115) 

Hence, 

Q ba > 1 implies Q ab < 1, 

(116) 

and 

Q* > 1 implies Q ba < 1. 

(117) 

More generally, 

Q ab = 1 implies Q ba — 1, 

(118) 

where the inequalities are to be taken in the indicated order. 

However, 

Q ab ** Qba ’ 

(119) 


except under special circumstances. The above relations are 
consequences of the concavity of the indifference loci. 

Of course in the absence of knowledge of the preference field, 
it is impossible to compute these index numbers, since we do not 
know ( X ab ) and (X ba ). From the last section we know that 

£ P b x X ab s £ P b l X ab < £ P b X ab < £ P b u X ab . (120) 

£P b X b ^ £ P b X b nab £ P h X b 
£ P b x X 04 = £ p b l X ab ^ ^ £ P b u X ab ’ 


Hence, 


( 121 ) 
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or 


^ T.PW 
£ ^ " £ J* ‘A - 4 V E i*A- 


( 122 ) 


As before, ('A* 4 ) can be computed only if wc know the expansion 
paths From (103) we find 


E P 4 X* p m * E P h X' 
E P> X X* “ p„ 4 E P a X‘ 

Therefore, 


023) 


*>„• E P h A l 
p„ 4 E P‘A‘ 


E P 6 X* 
E 'A'* 4 


>(H> ggg 
v z pvF 


“Paasche” 


(124) 


Similarly, 

Pi' ZP'X* 

Pi-E^A* 


E P*A“ 
E /*• ‘A 4 - 




_ 1 

"Laspcy res” 


As before, if we have 
or 


(A 4 )© (A'-), 
(A*)© (A 4 ), 


still other limits arc possible 

Now it is only in the case where the “Paasche” index is greater 
than one or the “Laspcjrcs” index is less than one that it is possible 
to say which of the situations is better But, and this is what makes 
this whole branch of theory a stcnlc exercise, it is alwny s possible to 
determine this fact without any limits or without even introducing 
such index numbers Tor our question is always answered, when 
it can be answered by the previous analysis of "revealed" prefer 
ence, and the introduction of quantity and price indexes is indirect, 
unnecessary, and misleading It is indirect and unnecessary be 
cause it is a deduction from the previous simpler analysis It is 
misleading Because of the tendency to .attach siycuScanrc to the 
numerical value of the index computed 46 There is not a single 
valid general theorem in the present field of index numbers which 
u n^t deducible from the analysis of the previous section 

Of course, in the case of expenditure proportionality certain new 
invariances do emerge (1) It is always possible to derive double 

10 See the penetrating remarks of W Leontief Composite Com modi t es and the 
^oblem of Index Numbers Economelrua vol IV (1936) 



TRANSFORMATIONS, COMMODITIES, RATIONING 163 
limits; (2) The respective index numbers are reciprocals, i.e., 

Q ba Q ab = 1 = (125) 

One last serious drawback to the present formulations of index 
number theory rests in the fact that these are expressed as ratios. 
If we admit productive factors as negative commodities, and for 
many purposes this greatly enlarges the generality of our analysis, 
denominators may vanish or change sign. Since all we wish is an 
algebraic comparison between value sums, it is unnecessary and 
undesirable to work with ratios. Instead the methods of the 
previous section are preferable. 

Pure Theory of Choice under Rationing 

Thus far in this chapter we have discussed the effects of general 
transformations of our dependent and independent variables. 
This led naturally into a study of composite commodities and of 
index numbers. At this point there is also suggested the interesting 
and important related problem of placing constraints upon the 
consumer in addition to that imposed by fixed total income. Of 
course, rationing involves such constraints. 

The simplest kind of rationing is that in which the government 
specifies the maximum amount of a particular commodity that each 
individual can consume. Sugar provides a common example, being 
relatively homogeneous and universally in demand. Ordinarily in 
rationing the individual is subjected to an inequality. He must 
not consume more than a given amount, but he need not consume 
as much as that amount. Of course, if the rationing is to be of 
any use, it will be applied to situations in which for many indi- 
viduals the allotted amount will actually be bought. Otherwise 
the law has no teeth, and individual behavior is left unchanged. 

The single individual maximizes ordinal utility as before except 
that he is now subjected to additional constraints of the form 

x i = bi, Xj SI bj, * * •» (126) 

where the commodities i, j, • • * are given maximum individual 
quotas of respective amounts per unit time, bj, • • • . To present 
the conditions of equilibrium in these circumstances we must dis- 
tinguish carefully between various possible cases. The simplest is 
that in which the rationing quotas are so small that each is effective. 
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in the sense of expenditure on future goods? Upon reflection I 
think the reader will conclude that this is primarily a verbal prob- 
lem, much aggravated by the customary loose and ambiguous usage 
of the concepts of “money” and numeraire. 

Various conventions are open to us, any of which is satisfactory 
provided that it is adhered to consistently. We may include 
among our commodities goods of different periods of time, and we 
may assume certain expectations with respect to future prices. 
For the present purpose it is simpler to assume that total expendi- 
ture upon present commodities can differ from present income by 
the algebraic amount of saving without going into the problem of 
the form (cash, securities, etc.) that these savings take. If saving 
is determined by the usual preference calculus, certain implicit as- 
sumptions are made with respect to future incomes, prices, ordinal 
rates of time preference, etc. 

If it wishes to, a powerful government can arbitrarily limit the 
consumption of all present goods, and at the same time permit only 
an arbitrarily small amount of the surplus of income to be carried 
over for future uses. Ordinarily, in connection with a rationing 
program the State does not so limit saving, relying instead upon 
a personal tax program to bring about this desired result. Further 
discussion of this point may therefore be deferred. 

Single commodity rationing has certain shortcomings which 
give rise to a demand for some form of “point-rationing.” Instead 
of limiting the amount of a single commodity, the individual is 
limited to a weighted sum of a number of commodities, the point- 
prices providing the relative weights. But such a weighted sum 
represents nothing more than a single commodity in a new trans- 
formed set of variables. It is for this reason that the general 
theory of rationing belongs in the present chapter on transforma- 
tions and composite commodities. 

It would lead us into the field of welfare economics if we were 
to discuss the criteria used in classifying commodities into groups 
and the determination of their point values. Suffice it to say that 
administrative considerations, criteria of substitutability on the 
consumption and production sides, all contribute to the decision 
as to which commodities shall fall in the same group, what the 
relative values of each shall be, how many groups there shall be, etc. 
Moreover, although this is not strictly necessary, each comm v 
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is usually restricted to but one group so as to obvntc the necessity 
of handling many kinds of tokens or stamps m making a given 
purchase 

In addition then to the dollar prices (pi, , p„), and money 
income, I, each consumer is confronted with r classes of point 
prices (p/, , />„'), (Pi", , />*"), , {pi, , p%), and fixed 

totals of expendable points per unit time (/', I ', , I’) Each 

of the sets of point prices will have zeros for most commodities and 
for a given commodity all but one point price will usually be zero 

This being the case, the consumer will maximize U{x\, , x % ) 

subject to the generalized budgetary constraints 

£ p } x, s I, £ P/xj^r, , £ p/x, 2 I' (128) 

1 i i 

For the moment the matrix CP/3 will be assumed to be of rank 
(r+1) There ml! result optimal amounts of each good for each 
specified complete set of point prices and expenditure allotment 
We may summarize this result by writing down the generalized 
demand schedule for each good, written as a function of all prices 
and points and all total incomes or expenditures Thus 

p. pi .pi’, .p.'.i.r, J') 

(, = 1 , n) (129) 

It is the purpose of the regulating authorities so to determine point 
prices and allotments as to result in equitable" inter individual 
amounts of consumption, appropriate total output at appropriate 
dollar prices In addition to such policy matters, the economist is 
interested m the purely positive problem of determining the prop 
erties of these demand functions implied by the ordinal maxitmza 
tion process 

For this purpose we must examine the maximum conditions of 
equilibrium By the usual Lagrangean multiplier technique the 
first order conditions are easily seen to be 

u % -v- \p t + yp: + -f* x t p, t = o, — i , , n) ( 130 ) 

provided that all of the group allotments arc ‘ effective’ as indi 
cated by the presence of the equality signs in (128) If any pit 
ticular group allotment is ineffective for the individual in question 
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id; 


its Lagrangean multiplier may be set equal to zero in the above 
equations. 

In words: the consumer will purchase each good up to the point 
where its marginal utility is equal to a weighted average of its 
various dollar and point prices, the weights being the marginal 
utilities of the last dollar or group ration coupons. 51 

For a regular maximum our necessary and sufficient secondary 
conditions are contained in the statement that the Hessian of the 
utility function must represent the matrix of a quadratic form 
which is negative definite under the (r -f- 1) linear constraints. 
This is equivalent to certain conditions on the matrix formed by 
bordering the Hessian with the matrix of the constraints, the 
transpose of the latter, and zeros. If we strike out rows and 
columns corresponding to each of {n — r -p 1) commodities in 
turn, the resulting {n — r -}- 1) principal minors must oscillate in 
sign, the smallest one being negative, the next positive, etc. 
Thus let 


Then 



(hi = 1, 

(k,s = 0, •••,?-) (131) 

[vi = (n — r -f l)i • * •» «] 


(- l) CT A m > 0. 


(132) 


As throughout this book, the fundamental observationally 
meaningful restrictions on observable prices and quantities emerge 
as consequences of these secondary extremum conditions. In this 
case such implications can most simply be stated after the concept 
of a compensated price change has been suitably generalized. At 
least since the time of Slutsky it has been customary to deal with 
a compound change in price and income, where the latter is \ aried 
along with the former so as to leave the individual on the same 
indifference curve. 

When rationing enters the picture so that we have auxiliary 
constraints, the problem is made more complex. An increase in a 

31 This seems to have been pointed out in print for the first time by T. Scitmskj , 
"The Political Economy of Consumers' Rationing," Review of Economic Statistics, 

(1942), 1 14-124. Other theoretical aspects of rationing are discussed m H. P. Netsser, 
"Theoretical Aspects of Rationing," Quarterly Journal of Economics, L\ II (194j), 
378-397. Also see N. Kalecki, “General Rationing,” Oxford Bulletin of Statistics, 

III (1941). 
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given price, or in a given point price, will make the individual 
worse off. But he can be compensated for this in many different 
ways— by a change in money income, by a change in his point 
allotments of the same or of a different group, or by any combina- 
tion of these. It is most useful for the present purpose to consider 
that special compensated change in the dollar or point price of a 
commodity in \%hich the individual is kept just as well off as before 
by means of a simultaneous change in the same total expenditure 
allotment (dollar or particular group point allotment). Let the 
change in the ith good resulting from a change in the £th group 
point price of the ,/th good when compensated as above be written 
in the form (dx,/0/>/) wmp Then regarded as an n by n matrix, 
with k a fixed number, this must be negative semi-definite of rank 
(n — r + 1). Thus, the compensated change in a good with re- 
spect to one of its own point prices must always be negative; in 
addition, we have inequalities on certain cross product and inter- 
action terms These are expressed completely by the statement 
that the smallest naturally ordered principal minor of the above 
matrix must be negative, the next positive, etc., and finally the 
(w — r -f 2)th and all higher principal minors must vanish. The 
last part of this statement is equivalent to the obvious proposition 
that the demand functions are homogeneous of order zero in each 
set of point prices and expenditure 

If one wishes, off diagonal terms may be interpreted as general- 
ized complementarity coefficients, and it may be left as an exercise 
to the reader to show that these obey the usual symmetry rules. 
In fact, the reader may easily show that the matrix of compensa- 
tion terms is nothing more than the northwest partition of the 
inverse of the matrix, after this submatrix has been premultipHed 
by the scalar (— X*) 31 

The interested reader may also develop for himself the general- 
ized Le Chatelier-Braun principle introduced in chapter iii. I n 
words, its economic significance may be summarized as follows: if 
in a given position of equilibrium a (compensated) change in price 
is made, the resulting change in amount demanded of that good 
will be greater if the individual is not subjected to the extra con- 
straints of rationing than if he is subjected to such constraints? 

t 11 See chap v, p 103 Also see p 378. 
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furthermore, the introduction of each new constraint will make 
demand still more inelastic. 

The tendency for demand to become inelastic under rationing 
has a number of important implications for policy. In particular, 
it helps to explain why in a system which has already been subjected 
to numerous direct controls, fiscal measures aimed at sopping up 
superfluous income have effects which are of only secondary impor- 
tance. To explore this further would involve us in considerations 
of effective demand outside the scope of the present discussion. 

Most economists will not be interested in the somewhat arti- 
ficial compensated change in point-price, but rather in the ordinary 
ceteris paribus change, in which other point-prices and expenditure 
are kept constant. It is easy to show by explicit differentiation 
of the equilibrium conditions, as well as from general considerations, 
that this may be split up into two parts: the compensated change 
and the income effect. Thus, 

(dX{/dpj k ) cet. par. = (dXi/dp/ : )comp. ~ X,{dXi/dI k ). (133) 

Except for (“generalized”) inferior consumption goods, the second 
term will be positive; consequently, a change in a good resulting 
from an increase in its point price must be negative'in all normal 
cases. But if we have generalized inferiority, we may (but need 
not) have a generalized Giffen’s paradox, in which raising the 
point-price of a commodity causes more of it to be bought. 

Our discussion of rationing may be brought to a close with a 
few comments on the special mathematical features which dis- 
tinguish it from the general analytic case of any linear constraints. 
First, it is almost universally the custom to make all point prices 
positive, even though one could in theory imagine a case in which 
there were negative prices. (For example, ration tickets might be 
given to a consumer who will take certain redundant staples, etc.) 

Second, the authorities often set ration point-prices proportional 
to dollar prices. A special case of this is provided by dollar ex- 
penditure control on various commodities or groups of commodi- 
ties. An even more special case is that of over-all expenditure 
control, in which the individual is given a quota of dollar expendi- 
ture, ordinarily less in amount than his disposable income. If 
savings are not treated as one of the n commodities, this is equiva- 
lent to introducing linear constraints whose rank is less than 
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(r + 1); consequently the equations of (128) are inconsistent if 
equality signs are used We must then have recourse to inequali- 
ties, of course, in this case the dollar budget equation must be 
relaxed in favor of the new quota allotment. 33 In rare cases where 
ration tickets or quotas arc over-generous, the auxiliary constraint 
will become an inequality and the rationing will be ineffective 

A different mathematical problem arises if the consumer can 
pay for a commodity by giving up a certain number of ration 
stamps of one kind or by giving up some of another. Depending 
upon the relative prices and scarcities of the different stamps, he 
will ordinarily choose to spend one rather than the other in a 
fashion reminiscent of Gresham’s Law. If he has similar options 
with respect to many commodities but at different relative ratios, 
he will spend a given kind of stamp on those commodities in which 
it is relatively advantageous in a manner formally identical with 
the classical theory of comparative advantage as applied to the 
determination of which of many commodities will be exported and 
which will be imported. 

The above case merges into that in which the different kinds 
of stamps can be converted into each other at rates determined on 
a black or white market, or by the government. Unless the 
government were explicitly to ban such transactions, there would 
inevitably arise trade in the different ration groups and money 
Arbitrage would create ruling market ratios of exchange which any 
individual as a small consumer could not appreciably affect. Once 
the individual is confronted with given rates at which he can buy 
or sell each type of ration points, the problem is no longer one of 

“ Since in practice a commodity is not given more than one ration point price, 
“degeneracy" cannot arise in any other way. If the authorities should require multiple 
point prices for a given commodity, unless care is taken the consumer may be put into 
the position of not being able to spend all his points This difficulty may arise in two 
different ways through true degeneracy and inconsistency in the auxiliary constraints, 
or from the fact that the admissible solutions of the linear equations do not yield positive 
amounts of all commodities When this is the case, the first order conditions of equi- 
librium given in (130) are modified, but not as before by setting certain of the Lagrangean 
multipliers equal to 2 ero, but rather by replacing certain equality signs by greater than 
signs Any commodity which is not bought will have a marginal utility less than the 
above specified weighted average of point prices The problems raised in this footnote 
are similar to those discussed by Schlesinger, Wald, v Neuman, Neisser, and v Stack el berg 
m connection with the consistency and independence of the equations of the Walrasian 
iry of production in its simplest constant coefficient form 
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many auxiliary constraints. Thus, if orange stamps can be bought 
or sold for five cents apiece, and if a commodity costs twenty-five 
cents and two orange stamps, it can simply be regarded as a com- 
modity which costs thirty-five cents. The consumer simply reverts 
back to the standard type of budget equation, modified only by 
the fact that his money income may be increased or decreased by 
the need to buy or sell ration points; or, what is simply another 
way of saying the same thing, the individual has higher income and 
is confronted by a new higher set of prices. If the dollar values 
of the first, second, • • •, and rth ration points are ( V , b ", 
the new budget equation becomes 

£ (Pi + £ b k p, k )xi = / + £ b k I k . (134) 

1 x 1 

Actually, from a welfare point of view it can be shown that the 
free interchange of different kinds of stamps, among themselves 
and against money, is in a certain sense optimal. In fact, it is an 
indirect way of permitting individuals to exchange goods so long as 
the exchange is mutually advantageous. These last two sentences 
must be qualified. They must not be taken to mean that every 
individual will be better off if coupons or goods can be interchanged. 
Thus, if the rich buy redundant ration coupons from the poor to 
the mutual advantage of both, their expenditure by the rich will 
cause an increase in the point price of the scarce rationed goods in 
question to the disadvantage of the middle classes. 

Nevertheless, it can be shown that free interchange is optimal 
in the sense that its introduction accompanied by appropriate 
modifications in the point allocations to every individual could 
lead to an improvement for everybody. In the above examp e, 
the middle classes could be bribed into acquiescence, and there 
would still be a margin of advantage left for rich and poor. I do 
not imply by this that the middle classes should receive such a bribe 
since that would suggest a belief in the perfection of the previous 
status quo. Nor should it be thought that anything said here is an 
argument for making coupons interchangeable, since there might 
very well be in actual fact very grave difficulties in the way of 
devising a method of point allocation which would recognize the 
harm done to particular individuals. 



CHAPTER VII 


SOME SPECIAL ASPECTS OF THE THEORY OF 
CONSUMERS BEHAVIOR 

Chapter v exhausts the content of utility analysis in its most 
general form involving only an ordinal preference field There 
remains in the literature a great number of discussions of particular 
problems which involve special and extra assumptions In order 
to present a fairly complete account of the present status of the 
theory I propose to examine some of these carefullj to show their 
empirical meaning This involves a break in the unity of exposi 
tion since each special assumption has often been made mdepend 
ently of all others There is no choice but to go through the list 
with no regard for continuity Among the topics to be discussed 
will be the cardinal measure of utility independence of utilities 
and measures of complementarity and constancy of the marginal 
utility of money 

It is clear that every assumption either places restrictions upon 
our empirical data or is meaningless A price must be paid for 
any simplifications introduced into our basic hypotheses This 
price is the limiting of the field of applicability and relevance of 
the theory because of the extra empirical restrictions to be im 
posed on the data Many writers do not appear to be aware of 
this, m any case few have indicated the costliness of their assump 
tions or have adduced any evidence to support a presumption of 
their admissibility 

There is a further serious difficulty Despite the fact that 
developments in this field are not recent and that mathematical 
methods of exposition have been employed ambiguity still per 
meates the contentions of many writers This ambiguity can go 
unnoticed precisely because there has been so little interest m the 
operational significance of these assumptions To put the matter 
somewhat harshly ambiguously defined assumptions are used to 
give a semblance of deriving theorems which are themselves 

onclusivc 
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A recurring source of difficulty in this connection which goes 
back as far as Marshall is the practice of introducing certain mathe- 
matical relations as alleged “approximations.” These are pre- 
sented as being valid in the neighborhood of a point of equilibrium. 
Even if such a relationship be admissible (as in the case of Taylor’s 
expansion), precisely because it can be applied to any and all 
properly continuous functions it is devoid of meaningful sig- 
nificance. Moreover, in common usage the restrictions which are 
introduced as allegedly holding only for a restricted neighborhood 
of equilibrium are in fact used to deduce results which follow only 
from an entirely different interpretation of these assumptions. 
We shall have occasion to enter more deeply into these matters at 
later points in the discussion. 

The Cardinal Measure of Utility 

We have seen from chapter v that all empirical market behavior 
is independent of the choice of a particular index of utility, or 
indeed of the choice of any measure of utility at all . 1 Nevertheless, 
many writers have wished to introduce the concept of a cardinal 
measure of utility, unique except for constants of scale and origin. 

Lange , 2 Fisher , 3 and others have contended that measurable 
utility, while superfluous from the standpoint of positivistic be- 
havioristic description, is necessary for the purpose of a normative 
science of welfare economics. 

While I cannot consider this problem in detail, it is ivell to point 
out that this is not at all necessary. Assuming that Welfare Eco- 
nomics involves comparisons between individuals, it is sufficient 
that explicit welfare judgments be made such that we are able to 
relate ordinally all possible combinations of goods and services 
consumed by each and every individual. Nothing at all is gained 
by the selection of individual cardinal measures of utility . 4 

1 Some writers have partially recognized this fact, but still maintain that the avoid- 
ance of the use of utility is a “stunt,” an axiomatic experiment by means of which we 
make our way more difficult. From the present operational orientation this view is 
clearly superficial. 

2 Oscar Lange, "The Determinateness of the Utility Function,” Review of Economic 
Studies, I (1934), 218-225. 

3 Irving Fisher, “A Statistical Method for Measuring ‘Marginal Utility and Testing 
the Justice of a Progressive Income Tax,” in Economic Essays in Honor of John Bates 
Clark (New York, 1927). 

4 Of course, if one considers general welfare as the algebraic sum of individual cardinal 
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In any case it is clear that there are an infinity of ways of select- 
ing a particular utility index and defining it as the true cardinal 
measure of utility. Thus, one might draw a straight line from 
the origin through a point representing an arbitrary bundle of 
goods and services. The numerical distance of any point on this 
line from the origin could serve as an index of utility, and might 
be dignified with a unique title 


The Assumption of Independent Utilities 

A method of selection such as that outlined above is obviously 
arbitrary. Others are more subtle, but equally arbitrary. Pro- 
fessors Fisher 6 and Frisch,® employing only price and quantity 
data, have measured what is alleged to be marginal and total 
utilities and how these magnitudes vary with changes in income, 
quantities, etc. Although the exact technique differs between 
these two writers, the fundamental principle remains the same. 
This can be illustrated with reference to a simplified case where 
only two consumption goods are involved. 

Given a great number of observations on prices, quantities, and 
total income, one could in the limit more or less trace out the whole 
indifference map We should still, however, have- said nothing 
about the numbering of the one-parameter family of indifference 
loci so traced out Both Professors Frisch and Fisher employ the 
following definition for selecting out a particular utility index to 
be designated the “true” measure of utility, subject to origin and 
scale constants That utility index, if it exists, is to be selected 
which can be written in the form 


i e , for which 


V = /(*) + g(y) ; 


o'xv'y 


0 . 


(1) 

( 2 ) 


utilities, then one will require the cardinal measurability of u tility But such an assump- 
tion is arbitrary and gratuitous Cf A. Burk (Bergson), “A Reformulation of Certain 
Aspects of Welfare Economics,” Quarterly Journal of Economics, vol LI I, no 2, Febru- 
ary, 1938 pp. 310-334 
* Fisher, op cit 

*R Frisch, ‘‘New Methods of Measuring Marginal Utility,” Beitrage sur okono- 
mischen Theorte, no 3 (Tubingen, 1932), H Schultz, The Theory and Measurement of 
demand, pp 111-117 
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If there exists one index of utility which can be written in the 
form of equation (1), then any other index which obeys the same 
law must differ from it only by a linear transformation. For, 
consider another index 


for which 


F = F(<p), 
d 2 F 


dxdy 


= 0 . 


(3) 


(4) 


Successively differentiating (3) partially with respect to x and y, 
we get 



dxdy F + F ‘Pz'Pu- 

(5) 

This, together with (2), requires that 



F"<Px<p v = 0, 

(6) 

or 


F"(*0 a 0. 

(7) 

Therefore, 


F — d bip, 

(S) 

where a and b 

are origin and scale constants respectively. 



It is clear, therefore, that the assumption that utilities shall be 
“independent” will help to select one utility index as the cardinal 
measure of utility. Nevertheless, even this convention is not in 
general applicable. It will guarantee us that we do not have two 
different utility scales, as has been shown in the above proof; it 
will not, in general, provide us with even one scale. 

If we assume an indifference field obeying the ordinary con- 
cavity restrictions and nothing more, then there will not, in general, 
be even one utility index which can be written in the form 

<p = /(x) + g{y), 

<Pxy = 0 . 

Let us write out one legitimate utility index. 

II = H{x, y). 

Does there exist a transformation F such that 


(9) 


( 10 ) 


<p = F[_H{x, y)] = f(x) + g(y)? 


(ID 
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The answer m general must be in the ncgativ c T urther arbitrary 

restrictions must first be placed upon the indifference field 

Let the indifference field be defined in the following form, mde 
pendent of any utility concept 

-(!’)„ =«*•»>• < 12 > 

\ <** /// "coiulint 

where R is a function of x and y obeying the following curvature 
requirements 

R t - R V R< 0 (13) 

The necessary and sufficient condition that there exist a utility 
index which can be written m the form 


is as follows 


V> “ /(*) + g(y), 

V>„ = 0 

RRjy ~ RjRy = 0 


9* log R 
dxdy 

1 he necessity is \cnhcd by the different! ition of 


R(x.>) 


fix) 

g'(>) 


Hie sufficiency is also easdj indicated 
If 

0* log R 


dxdy 


= 0 


log R = log h(x) - log k(y) - log 


H}) 

*(>)' 


where h and k are arbitrary functions Form the expression 
Rdx + dy = '^jdx + 

I his can easily be transformed into tile ex tet differential 
dip = h{x)dx -f k(y)dy. 


(14) 

(15) 

(16) 

(17) 

(18) 


(19) 
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or 


<P = f h(x)dx + f k(y)dy + constant 
•Ja *Jb 

= fix) + g(y). 


( 20 ) 


We must now investigate the meaning of the restriction in (14). 
The assumption of independence of utilities in order to define a 
cardinal measure of utility is seen to involve (1) a convention by 
means of which one out of an infinity of possible utility scales is 
designated as the true cardinal measure of utility; (2) an arbitrary 
a priori restriction upon the preference field, and hence upon em- 
pirical price-quantity behavior. The meaning of this restriction 
we must now investigate. 

The functional restriction (14) is a partial differential equation 
of the second order of the general form 


M(R,R x > Ryy Rxx i Ryyi Rxyj x, y) = 0. (21) 

Subject to boundary conditions involving two arbitrary functions, 
it will serve to define a unique solution function 

R = R(x, y). (22) 

More specifically, if we are given as empirical observational data 
the two expenditure paths corresponding to the changes in quan- 
tities with income in each of two respective price situations, then 
from these observations, and these alone, the whole field of in- 
difference curves can be determined by suitable extrapolation. 

It is not easy to visualize intuitively why this should be so; 
indeed, few economists would be so bold as to claim that the be- 
havior of an individual in all conceivable circumstances should be 
derivable from so few observations. And yet this is the conclusion 
to which we are forced by the apparently innocuous assumption of 
independence of utilities. 

Moreover, (14) places definite restrictions on our demand func- 
tions, the validity of which are equally dubious and equally im- 
possible to comprehend intuitively. For the simple two-commodity 
case our conditions of demand equilibrium can be written 


4r = R(x, y), 

r V 

I = PrX + P v y. 


(23) 
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These may be transformed into 



These demand equations must be subject to the restriction 
dxdy ~ ^ 


( 24 ) 


(25) 


When there are more than two goods the restrictions implied 
by the very possibility of an independent index, of utility take on 
a different and more complicated form If there exists an index F 
for which 

F„ a 0, (t 9 * 3 ) (26) 

then 

F tt = F\tp)<p , + F"(v)(p x <p } = 0, (27) 


where <p is any other index Thus 


H-n*) ( 28 ) 

where T is an arbitrary function, and v 5 's any index of utility 
Taking into account the ( n — 1 )(n — 2)/2 conditions of Integra 
bihty this implies an additional n(n — l)/2 conditions It is to 
be noted that these are identities holding everywhere Not only 
are they necessary, but the transformation 

F = £eS.-Wdv (29) 

shows them to be sufficient as well 

In terms of the indifference varieties, these take still another 
form Let 

% m - fir “ *■*'(*. . *») ( 30 > 

These, of course, satisfy the identities 
*Ri 

Tgi = k R J (hj,k *= 1, ,n) 


(31) 
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If an independent index of utility is possible, then we must have 
d { R j 

= °- i) (32) 

In view of the n 3 relations of (31), the above n{n - l) 2 relations 
are not all independent. At most n(n - 2) are independent, and 
these may be written in the form 

(i ^ j), (hj = 2, • • •, n) 

(33) 

(i = 3, • - •, n) 

These conditions are both necessary and sufficient. They im- 
ply among other things the ( n — 1 )(n — 2)/2 integrability condi- 
tions of equation (14) of chapter v. On the other hand, if the 
latter are postulated at the beginning, then equations (33) cease 
to be all independent and can be reduced in number. 

We have then n(n — 2) partial differential equations of the first 
order. Subject to an equal number of arbitrary functions, the 
general solution is uniquely determined. But empirically an ob- 
servation of an expenditure path involves ( n — 1) functions. 
Hence, observation of more than n{n — 2)/ (n — 1), or more than 
n expenditure paths, could be used to disprove the possibility of 
independence. 

In words, the implications of independence are that the amount 
bought of any good x { can be expressed in terms of its price p x , 
the price of any other good pj, and the amount of expenditure upon 
these two goods alone, i.e., upon the amount left over out of total 
expenditure after all other goods are bought. Thus, the general 
demand function 

Xi = h%pl, ■■•,pn,I) 

can be written in the special forms 

h'(p lt ■■■,p n ,I) s IP’Xpi, p 3 ; I + piXi + pjxj - £ p k Xk) (34) 

1 

for any i not equal to j. Subject to the conditions of (25) and 
certain consistency relationships, these conditions seem to be 
sufficient as well as necessary. 
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Except for R. G. D. Allen’s derivation of equation (IS) for the 
two-good case, I am not aware that these full price-quantity impli- 
cations of independence have previously been derived . 7 However, 
fragmentary sets of necessary but by no means sufficient conditions 
for independent goods have been derived by Slutsky and by Milton 
Friedman 8 They differ fundamentally from those above, and are 
of a species which I choose to call local implications of independ- 
ence. At a given point in the ( X ) space it may be possible to find 
a transformation F(<p) such that 

F tJ =* F'<pt, + F"<p x f>, = 0. (t 7 * j) (35) 


Thus, if there are only two goods (xi) and (x 2 ), this is always 
possible at every point so that the local conditions degenerate into 
trivial identities At any point (X) = (xi, x 2 ), let 


and 


F”(<p) _ 

F'(v) Vi(X) Vi (X) ’ 
Fn(X) = 0. 


(36) 

(37) 


This differs from the previous non-local conditions in that the 
above relation is an equality at a point and not an identity 

In the general case of n goods a local transformation of this 
type is possible only if the following relations hold, 


<Pn(-X) <Px,(X) 

^(X) Vi (X) " Vt {X) <p,(X) • 


(* 1 ) 


(38) 


These are [n(n — l)/2 — 1] independent conditions, and as before 
need not be identities. Even if they hold everywhere , independence 
is not necessarily implied. If the local conditions hold, the bordered 
determinant of the type given in equation (34) of chapter v takes 
on a special form, and consequently certain restrictions upon the 
demand functions can be derived. 

Slutsky 9 shows that 

(39) 

1 R. G D Allen, “A Comparison between Different Definitions of Complementary 
and Competitive Goods," Econontelrtca, II (1934) 168-175 

1 Milton Friedman, "Professor Pigou's Method for Measuring Elasticities of Demand 
from Budgetary Data,” Quarterly Journal of Economics, November, 1935, pp 151-163 
* E. Slutsky, “Sulla teona del bilancio del consumatore,” Giornale degh Economist*, 
U (1915), 23-26 
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where 



K n 

K u 

■■■ K ln 

dxi 

~dl 


K u 

k 22 

••* K 2n 

dx 2 

~dl 

p = 

■ 

■ 

• 

* 


K ln 

K 2n 

••• K nn 

dXn 

a7 


dxi 

JT 

dx 2 

dl 

dx n 

or 

0 


and the subscripts indicate appropriate cofactors. Also 


e, = - 


d log (- X) 
dl 


<P 


in the notation of chapter v. 

If an index exists such that 


at a point, then 


Of = 


Fa = 0, (i ^ j) 


P 12 


Pa 


Pn+\,n+l.l2, Pn+\,n+\.ij 


j 1) 


l8l 


(40) 


(41) 


(42) 

(43) 


These are [ft (n — l)/2 — 1] independent and meaningful restric- 
tions on the demand functions and completely exhaust the local 
implications of the independence assumption. 

I should also like to point out that equations (43) might be used 
to determine the marginal utility of income if (1) independence is 
possible; (2) that utility index which can be written as an additive 
sum of independent utility is defined as the “true” cardinal index 
of utility. I commend it to the attention of the never lacking 
army of utility measurers. 

Mr. Friedman has derived the following n(n — 1) local inde- 
pendence conditions: 10 

Vii _ / Vil 1 — ktfii \ j?i£ j" ( kj — hi) ~ kikj{r}il — Tjj ! ) 1 _ 

Vii \ VjT 1 — kjtjji ) rjjj |_ 1 — kflji J 


In the two-dimensional case the restriction is vacuous. 


10 Friedman, op. cit., p. 162. 
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From the fact that their number is excessive, it is clear that 
they cannot all be independent. In fact, I am unable to prove 
that they are complete, i e , equivalent to the above Slutsky condi- 
tions; indeed, I would venture the conjecture that they are not 11 

It would be literally impossible for any individual to determine 
by introspection whether or not his demand functions satisfied the 
above relations Indeed, the likelihood that such relations, se- 
lected arbitrarily from an infinity of possible functional relations, 
should hold is infinitely small The minute amount of plausibility 

11 On page 162, footnote 2, Mr Friedman argues against the admissibility of the 
Pigou theorem that 

5S.S2 wta, t-o-il (.) 

Vii Vit Vil Vil 

on the grounds that “ there is a presumption that ij,/ and k { [my notation} are 
immtly related ” The whole problem is of course ambiguous until a particular set of 
indifference curves are specified Nevertheless within the realm of probability and 
presumption, Mr Friedmans statement seems to be incorrect In chapter v it is 
shown that 

£ *.w 

it. 

i e , a weighted average of all income elasticities equals unity regardless of the number 
of goods The average k is given by 



This approaches zero as the number of goods increases 

A survey of empirical budget studies will convince the reader that the income 
elasticities are distributed around unity, while the proportions vary around 1 /» 

See also A C Pigou and N Georgescu Roegen, “Marginal Utility of Money and 
Elasticities of Demand,' Quarterly Journal of Economics, vol L, no 3 (1936) 

The above Pigou theorem rests, nevertheless upon the restrictive assumption of 
independence If it could be shown that it holds for small values of in every case, 
it would be much more useful 

That this cannot be true in general is shown by the following special case Let 
consumption of n goods be completely * joint in the sense that there are always fixed 
proportions Then regardless of the number of goods >},, = !, and n,ihii = 1 0 n 
the other hand it is easily shown that ij < s — x,p,/I Consequently, — k,/k, 
Even though we make the k s become arbitrarily small as compared to the income elas 
tiaties, their ratio may take on any value other than unity Thus, the Pigou theorem 
(») is false 

Of course, this example rests upon discontinuities in the higher derivatives, but I 
should think that we could approximate this condition by appropriate choice of con 
tinuous derivatives 
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to the proposition that independence is an admissible assumption 
comes from the fact that the question is usually posed in such a 
way that independence appears to be an intermediate class between 
the extreme classes of competitiveness and complementarity. Em- 
ploying a rudimentary concept of the equal probability of the 
unknown, one is inclined at first blush to acquiesce to the inde- 
pendence assumption. The error involved in so doing is obvious 
from our previous discussion, completely aside from the crucial 
ambiguity attaching to the older notions of independence and 
complementarity . 12 

Complementarity 


In my opinion the problem of complementarity has received 
more attention than is merited by its intrinsic importance. Never- 
theless, as a result of the interest in this subject, crucial incon- 
sistencies in the thought of Pareto were revealed by Hicks and 
Allen, and much light was thrown on the cardinal and ordinal 
conceptions of utility. 

The older writers, Fisher, Pareto, and Edgeworth, suggested 
as a qualitative definition of complementarity between two goods, 
Xi and Xj, the sign of the cross derivative of the utility function; 
i.e., goods were complementary, independent, or competitive de- 
pending upon whether 


dV ^ 

dXidXj ^ 


0 . 


(45) 


If one assumes only an ordinal preference field, all numerical utility 
indexes are equally admissible. This cross derivative is not in- 
variant in sign under a change in the index of utility. Consider 
a monotonic transformation of <p. 


U = F(<p), £ > 0 

rr _dF 

Ui ~ dp 9i ' 


(46) 


TT dF 1 

U « = Tp 9ii + 


d 2 F 

d^ 


12 It may be argued that regarded purely as a working hypothesis the facts do not 
sharply contradict the independence assumption. A little investigation reveals that 
such a hypothesis has not been tested from this point of view. On the contrary, it is 
implicitly assumed from the beginning in the manipulation of the statistical data. 
Hence, one would have to go back to examine the original empirical data. It is interest- 
ing to note that observations on three expenditure paths would be sufficient to contra' 
the independence assumption in the case of two commodities. 
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By a proper selection of the arbitrary terra, cPFfdy 2 , the sign of 
the new cross derivative can be made to differ from that of the old 
As a corollary of this lack of invariance, complementarity so defined 
will have nothing to do with the budgetary habits of individuals 
with respect to the two goods in question. Similarly, an indi 
vidual’s introspective, spontaneous, intuitive designation of two 
goods as complementary or competitive (sugar and coffee, beef or 
pork, etc ) corresponds not to such a measure as this, but rather 
to behavioristic properties of the preference field and demand 
functions 15 

In the course of the last five years the world’s best economists 
have spent considerable time and energy in the study of Professor 
Hicks’s Value and Capital 14 It is perhaps symptomatic of the 
essential unimportance of the concept of complementarity that in 
this period no one seems to have noticed that the author gives two 
or more distinct (and inconsistent?) definitions of complementarity. 
The verbal definition in the text (p 44) seems to be different from 
that in the mathematical appendix (paragraph 8) 1S 


13 Hicks and Allen, Slutsky, and Schultz have suggested invariant measures of 
complementarity which are properties of the indifference curve system and demand 
function Perhaps the simplest measure of complementarity between two goods Xi 
and x„ is the sign of 




K<,Pt 

x< 


If there are but two commodities, this must always be positive m sign, in the many 
commodity case at least one must be of positive sign in order to satisfy the relationships 


E p,Ki, =* 0, 


(» = 1, , «) 


K„ <0 

Professor Leontief has suggested the following invariant measure of independence, 
corresponding most closely to the old notions 

a* ,o g( — ) 
d*‘R- _ ^\U,/ _ 

dx,dxi dXtdx, 

Since the above lines w ere first written Professor Hicks s two works. Value and 
Capital and Theorte Mathemattque de La Valeur, have appeared I still believe, however, 

that Hicks’s own solid contributions to economic theory do not rest on his treatment of 
complementarity, and that the extended discussion of the concept is a tribute to an old 
love rather than the necessary consequence of the subject matter 
"J R. Hicks, Value and Capital (Oxford, 1939) 

11 Actually, there is still a third definition in the footnote on page 44 Only xn the 
case of three goods is it clearly equivalent to one of the other two, in the general case it 
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This may be seen in many ways. While the mathematical 
definition can be applied to the case where there are only two goods, 
the literary definition cannot. (It is irrelevant that in the case of 
two goods the mathematical definition of complementarity admits 
of only one possible algebraic sense.) There is no reason why two 
distinct definitions should give the same answer in any particular 
case, so it is not surprising that one can invent examples ad inifini- 
tum for which two goods, such as wheat and linen, are on one of 
the definitions complements and on the other substitutes. But as 
we shall see in a moment, things are in even a worse state. Ac- 
cording to the definition of the literary text, it is possible for wheat 
and linen to be complements and substitutes at the same time, 
depending on the selection of the third good which is to serve as 
numeraire. The definition is ambiguous; instead of reflecting the 
properties of two goods, it (or rather they) represents the properties 
of three goods. 

The mathematical appendix definition gives as a coefficient of 
complementarity between the ith. and jth good the element of the 
ith row and jt h column of the matrix (^/) where 

x = HP, U) (47) 

represents in matrix notation the demand functions, for each 
indifference level. 

Because of the homogeneity property discussed in chapter v it 
is clear that these n demand functions cannot be inverted to give 
each p in terms of the x’s and U. Only relative prices can be 
determined. We can use any one price as numeraire, discard one 
of the above equations and invert the others so as to give ( n — 1) 
relative prices in terms of ( n — 1) goods and the ordinal level of U . 
It is most convenient to omit the quantity of the numeraire, x k , 
and its demand equation. This gives us 

—■ — h N i (x 1 , • • • , Xk- 1 , x k +i, ••• , x n , U ). (i n) (48) 

pk 

is ambiguously worded, since the offer of the maximum amount of money by the con- 
sumer to attain given amounts of the two goods in question will change the amount 
bought of all other goods. It would appear that the author intends that all other (than 
the two goods in question and the numeraire good) goods are to be held constant, in 
which case it becomes identical with the verbal definition. The difficulty may reside in 
Hicks’s unique use of the concept ‘‘money.” 



1 86 FOUNDATIONS OF ECONOMIC ANALYSIS 

Let (*2V/) = Nk be the matrix of this transformation Then the 
element corresponding to the ith price and jth quantity represents 
the literary definition of complementarity It can easily be shown 
that this, like the first matrix, must be symmetrical In fact, it 
is necessarily negative definite, like the ^ matrix 

These last two statements are consequences of the definition of 
the inverse demand functions according to which 

Nk = tk-'iPC'K), (49) 

where ypk is defined to be the matrix formed by omitting the kth 
row and &th column from the original ^ matrix Since it is known 
from earlier chapters to be negative definite, so too will be its 
inverse 

While the diagonal elements of the N and ^ matrices must agree 
in sign, being negative, the off dtagonal terms need not agree in 
sign except in the empirically unimportant case of three goods 
Even more important, the element in a given row and column of 
the N matrix may be of different sign depending upon which good 
is called money or numeraire A single instance will suffice to 
indicate this possibility Suppose we have n goods in a preference 
field that can be typified by additive utilities Let all but the last 
good have decreasing marginal utility Then on the literary defini 
tion the first two goods will be substitutes if the third good is used 
as numeraire, unless the last good has decreasing marginal utility, 
when it is used as numcratre, the first two goods will be comple 
ments There is no reason why one of the goods, which we may 
call the last, cannot be an inferior good Indeed, in the interesting 
borderline case where one of the goods has constant marginal 
utility, every pair of goods is independent on the literary definition 
when the good with constant marginal utility is used as numeraire 
if any other good is used as numeraire, all other pairs not involving 
the good with constant marginal utility are easily shown to be sub 
stitutes But according to the definition of the mathematical ap 
pendix, all pairs of goods not involving the one with constant mar 
ginal utility must always be independent, whatever the numeraire 

A typical element of Nk can be written out in terms of the slopes 
of the indifference loci, and the reader may verify for himself the 
,o pendence upon the subscript k 
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Walras was scrupulously careful in his use of the concept 
numeraire. He distinguished between it and money, in the sense 
of the portable, divisible, cognizable, durable, conventional com- 
modity which served as a counter in performing exchanges. 
Marshall, when he spoke of money, ordinarily meant nothing more 
than income, as we shall see in the discussion of the constancy of 
the marginal utility of income. Hicks seems to alternate between 
the above uses, and a third sense in which money stands for a 
composite commodity made up of all but one or two of the total 
number of commodities. The result is a tendency toward am- 
biguity, of which the above example is only one instance. 

If complementarity is not of interest for its own sake, can it not 
still be of importance as an indicator of conditions where certain 
“abnormal” phenomena are to be found? Thus, on page 71 in his 
discussion of the stability of a general equilibrium exchange system, 
Hicks says, “If income effects can be neglected, and if no comple- 
mentarity is present, then the system of exchange equilibrium must 
be stable.” It would seem, therefore, that the concept may be 
useful in indicating where a stable system will break down. Un- 
fortunately, this is not correct. The author has made a momentary 
slip, as he has since indicated elsewhere. If income effects can be 
neglected, then his matrix (X„), being the sum of the negative 
definite symmetrical matrices of all individuals, must itself be 
negative definite and symmetrical. It must therefore be perfectly 
stable on the Hicksian definition, regardless of complementarity . 16 
If income effects cannot be neglected, then the matrix may be 
asymmetrical; according to the author asymmetry tends to make 
for instability. This is not quite correct. Pure asymmetry, and 
nothing else^ tends to make for stability; the neglected income 
effects are the villains in the piece, and they do the most harm when 
their influence is not spent in creating asymmetry. 

Again in Hicks’ discussion of the effects upon the “period of 
production” of a change in the interest rate, the concept of com- 
plementarity is introduced. 1 ' But here, too, a correct rendition of 
the secondary maximum conditions shows that complementarity 

lc Since these lines were first written, I find that Hicks has detected his error. Cf. 
J. R. Hicks, “Consumers’ Surplus and Index Numbers,” Review of Economic Studies, 
vol. IX, no. 2, p. 133, n. 2. 

17 Hicks, Value and Capital, p. 222, n. 1 and p. 328. 
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cannot be such as to make the Hicksian period of production de- 
crease with a reduced interest rate. 

The Hicksian average period of production is defined as the 
elasticity 18 of discounted value, C, with respect to changes in the 
discount rate, 0 = (1 + *) -1 - Mathematically, it is given by 
(0dC/d0)/C. Its precise usefulness seems not to have been ex- 
plicitly indicated. Among other things, the Hicksian “average 
period” has the anomalous property of not being an “internal 
mean,” i e , one whose value lies within the limits of the greatest 
and least of the time periods being averaged. Thus, it cannot be 
considered a generalization of the simple Bohtn-Bawerkian average 
period of production. For example, consider the trivially simple 
“point-input-point-output” case where 99 £ worth of input is in- 
vested for one year to produce, say, $1.00 worth of output. The 
Bohm-Baw erkian average period t which unlike the Hicks concept 
requires a careful distinguishing between plus and minus items, is 
one year regardless of the interest rate. The Hicks period is 100 
years if the interest rate is zero and infinite if the interest rate is 
one per cent In fact, for the so-called “marginal investment” it 
is always infinite. By introducing discount factors into the aver- 
age, the author hoped, perhaps, to meet the Knightian objection 
that the period is infinite, but in many cases he seems only to have 
succeeded in making the finite infinite 19 

But this is not the place to go into the deeper Knightian objec- 
tions to the Austrian theory. It will be sufficient, in passing, to 
state what seems to be the only essential theorem relating produc- 
tion planning to the interest rate. By the methods of earlier 
chapters, it can be easily shown that where the firm acts to maximize 
C = X Q + *10! + *202 + + x n 0 n , (50) 

*• As its author indicates, here is an example of an elasticity expression which is not 
dimensionless For an analytic explanation of this the reader is referred to the first 
section of my chapter vi 

** Actually, it is not always necessary to introduce discount factors in order to make 
a process with infinite range yield a finite average Many n riters have shown how the 
distant elements receive smaller weights so as to create a convergent infinite series 
Thus, imagine a perpetual stew, to which something is always being added and from 
which something is always being taken out, at random Some part of what is now being 
added wilt never come out of the stew, just as some part of what is in the stew is of 
infinite age But it is a simple exercise in infinite processes to show that the average 
age of the stew is finite, and the average expectancy of a particle’s staying in the stew 
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a change in the discount factors must see the following inequalities 
realized 


E AxjApj ^ 0. (51) 

If the interest rates for all periods are equal, and if we go to the 
limit, we have the definite theorem 

E (dxj/dfijfi- 1 g; 0. (52) 

Constancy of the Marginal Utility of Income 

I return after the above digression to an examination of another 
special restrictive assumption with which the analysis of con- 
sumer’s behavior is often burdened. For its own sake the problem 
of constancy of the marginal utility of income is one of the least 
important in economic theory, but it has given rise to an endless 
amount of discussion, most of it not on a very high level, and it 
therefore deserves brief notice here. Inasmuch as I have given a 
rather complete analysis of this matter elsewhere, 20 1 do not propose 
to do more than summarize matters here, and to comment upon 
some more recent contributions which became available after the 
writing of the cited essay. 21 

As will be seen in the next section, constancy of the marginal 
utility of income derives most of its importance from its relation- 
ships, real and alleged, to the subject of consumer’s surplus. In 
the beginning, it is well to point out that one ambiguity runs 
through the whole literature; sometimes constancy is interpreted 
to mean constancy, and at other times it is intended to mean 
almost constancy. The last usage generally involves some sort of 
limiting process; either the change in question is supposed to be 
“small,” whatever that might mean, or the percentage spent upon 

10 P. A. Samuelson, "Constancy of the Marginal Utility of Income," Studies in 
Mathematical Economics and Econometrics: In Memory of Henry Schultz (University of 
Chicago, 1942), pp. 75-91. 

21 A. Henderson, "Consumer’s Surplus and the Compensating Variation,” Review of 
Economic Studies, vol. VIII, no. 2 (February, 1941), pp. 117-121. J. R. Hicks, “The 
Rehabilitation of Consumers’ Surplus,” same issue as above, pp. 108-116. J. R. Hicks, 
"Consumers’ Surplus and Index Numbers,” Review of Economic Studies, vol. IX, no. 2, 
pp. 126-137. Robert L. Bishop, “Consumer’s Surplus and Cardinal Utility,” Quarterly 
Journal of Economics, vol. LVII, no. 3 (May, 1943), pp. 421-449. 
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some specified good is to be small etc , etc The problem is still 
more complicated because the alleged result, itself rarely worded 
unambiguously, is more often than not held to be "likety’ rather 
than necessary “ I think, therefore that we shall accomplish more 
if in the beginning we examine the literal implications of constancy 
of the marginal utility of income to show rigorously the empirical 
implications of this hypothesis 

\Ve have already encountered the marginal utility of income in 
chapter v in the form of the — X of equation (31) It is clear from 
the preceding equilibrium equations (29) that the marginal utility 
of expenditure upon each and every commodity must be the same, 
equal moreover to the marginal (rate of) utility derived from an 
extra dollar of expenditure distributed optimally over all of the 
goods 13 

I have been meticulous in adhering to the usage of marginal 
utility of income , eschewing the more common terminology en 
countered in the literature of the marginal utility of money This 
brings up the second fatal ambiguity involved in the discussions of 
this subject Money has many different meanings in theoretical 
discussions, ranging from an abstract non metallic, non paper, con 
ventional unit of account to specific counters, to commodities 
commanding wide acceptance in exchange, to any commodity taken 
arbitrarily as numeraire, to income or expenditure It is reason 
ably clear from everything that Marshall has written and from 
the cast of his thought that he definitely intended to convey the 
meaning of money simply as a euphemism for income or expcndi 
ture, reckoned in pounds or dollars In his own words, it is "money 
or general purchasing power ” 14 In particular, it was foreign to 
his usage to think of money m the sense of numeraire, a concept 
which is not even listed in his index 

This is a matter of importance when we come to ask the ques 
Vic/Ti, ’W/Jri respect \t> ~w'rrat 'is Vrit rrnngnm 1 ! atririty ifi TTitt/me xn 

II This is another one of the numerous ptaces where Alfred Marshall left matters in 
a fog It was part of the flavor of the man not to bring things to a sharp focus But 
what is forgivable in a genius oannot be tolerated among lesser mortals 

“This is simply one example of the Wong Viner theorem discussed in chapter 
il p 34 chapter iv p 66 

w A Marshall Principles of Economics 8th edition p 838 For a more complete 

assion and for detailed citations the reader ts referred to my essay m the Schultz 
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money to be held constant? That Pareto might have interpreted 
this to mean constant marginal utility of a particular commodity 
chosen as numeraire , taken with respect to changes in the amount 
of that commodity, is understandable in view of the Lausanne 
usage. But that Hicks and others in the Anglo-Saxon tradition 
should have thought: this to be the Marshallian meaning is much 
more surprising. iN ow at this date it is not a very significant ques- 
tion as to what Marshall himself really meant, but it is of impor- 
tance to show the implications of at least two distinct and incon- 
sistent meanings. Furthermore, many writers have definitely 
fallen into misstatements concerning the necessity and sufficiency 
of parallel indifference curves (the geometrical embodiment of the 
first of the above formulations) if various identities are to be 
realized. 25 

A closer examination of equations (29) and (31) will show that 
the marginal utility of income cannot be constant with respect to 
everything. It is a function of all prices and income. If we were 
to double all prices and income, the marginal utility of income would 
have to become halved. For such a doubling of income and prices 
would by the homogeneity property described in chapter v leave 
all physical quantities unchanged, and hence all real marginal 
utilities unchanged. But the marginal utility of income is given 
by dividing each real marginal utility by the respective price. 
With the numerator unchanged, and with the denominator doubled, 
the whole expression must be halved. If the marginal utility of 
income were constant with respect to all prices and also with 
respect to income, it would have to be unaffected by a simultaneous 
change in them all ; since it must be halved by such a change, we 
have a contradiction. Consequently, the marginal utility of in- 
come cannot be invariant under changes in income and each and 
every price. 26 

At most the marginal utility of income might be independent 
of all but one of these (n + 1) magnitudes. We can set n first 
partial derivatives equal to zero, but not (n 1). Which n shall 

56 In addition there may be a misapprehension running through Hicks's treatment 
in his book and the two articles cited. As Bishop has pointed out, the problem is not 
primarily one of whether or not income effects can be neglected. Further, there is the 
question of keeping real income constant or “adjusting for changes in real income as 
seems to be implied in certain of the Hicksian statements. 

25 See Studies in Mathematical Economics and Econometrics, p. 76. 
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we choose ? This can clearly be done in (» + 1) alternative ways 
One of these involves the constancy of the marginal utility of in 
come with respect to n prices, but not with respect to income 
Elsewhere I have argued that this is the pure Marshallian case 
The other alternatives involve the constancy of marginal utility 
of income with respect to income and with respect to (» — 1) 
prices or w ith respect to all prices but one 27 This is the second 
hypothesis of constancy 

In the Schultz volume essay I worked out the implications of 
each of these distinct hypotheses In terms of everything that 
has ever been observed concerning price and budgetary data the 
implications of each are highly unrealistic, although it is the second 
of these which leads to really fantastic conclusions 

Before summarizing these empirical implications I should first 
point out that the marginal utility of income, being from a formal 
point of view a Lagrangean multiplier, must by the results of 
chapter vi, p 132, not be left unchanged if we go from one utility 
index to another, as is our privilege in an ordinal preference field 
At each point, for each different utility indicator, we have a differ 
ent marginal utility of income If the expression is constant for a 
given movement when we are using one indicator, it will not be 
constant if we use another indicator If, to get rid of the minus 
sign we define a new expression to be equal to —X, it is really 
necessary to indicate the cardinal index of utility to which it applies 
We may represent the new expression by the letter m, and append 
a subscript to indicate the utility index m question 
Then, 

m, = p ,, , pH, 1) = - X - !(pi, p, , p„ 1 ) (53) 

As shown in the Schultz volume this function must be homogeneous 
of order minus one , by Euler’s theorem on homogeneous functions 
the sum of its elasticities with respect to each of the independent 
variables must equal minus one, so that each cannot vanish 

* T The good whose price changes have an effect upon the marginal utility of income 
need not be the commodity selected as numeraire and it might prevent confusion if we 
made a point ol selecting some other commodity as money especially since there is 
no special reason within the scope of static theory why this special good should be 
ivis ble durable and have all the other properties of money Nevertheless in the 
mture the convention 13 followed of picking this good as numeraire 
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If we subject our utility index, <p, to a monotonic transformation 
to give us a new index, U = F(<p), in terms of our new utility 
index the marginal utility of income has the properties 


m v = F'((p)m v , 


dmu 

da 



I E-// d<P 
+ F 


( 54 ) 


Since we can make F" of any sign, the marginal utility of income 
can be made to change in any direction and amount we like, except 
in the special case which I have discussed elsewhere. 

Have we destroyed then all possibilities of speaking of constant 
marginal utility of income? Not quite, since we need only assume 
that there exists some one cardinal index (even if we ourselves pre- 
fer to use another cardinal index, or no one cardinal index) for 
which the strong, non-local restrictions of the two different hy- 
potheses hold. 

It is a mere exercise to show that the first, or pure Marshallian, 
hypothesis implies that the income elasticity of demand for each 
good must be unitary, and the price elasticity of demand for each 
good in terms of its own price must equal minus one. Moreover, 
the demand for each good is independent of changes in the prices 
of all other goods. There must exist one way of numbering the 
indifference curves so that utilities are additive and independent 
in the old sense ; actually except for arbitrary origin and scale the 
utility function is a linear combination of terms like log x,-, 
where the k coefficients represent the unchanging proportions spent 
upon each commodity, x. These implications are both necessary 
and sufficient. 

In view of the well-known Engel's laws and numerous budgetary 
studies it is hardly necessary to point out that this flies in the face 
of all observable reality, even as a first approximation in the sense 
of limitingly small changes. 

Performing a scarcely more difficult exercise in connection with 
the second hypothesis, it can be shown that it implies that the 
demand for each good except the numeraire depends only upon its 
own price in relation to the numeraire’s price, and is entirely 
independent of income, money or real. The demand for the 
numeraire depends in a corresponding way upon all prices and 
income. These conditions are both necessary and sufficient. 
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They imply among other things that the indifference varieties are 
parallel, their slopes being constant along the direction of change 
in the numeraire good alone. 

Of course no one has observed, and presumably no one ever 
will, a preference pattern in which all of extra income is spent upon 
one commodity . 58 Note that this is not even approximately true 
for instantaneous rates of change even when we neglect “second 
order of smalls.” 

Before we go into the reasons why anyone should ever have 
wished the marginal utility of income to be constant, a word may 
be said concerning the quasi-constancy of this magnitude. Of 
course, for a very small change in the initial situation, the marginal 
utility of income will change very little, as with everything else in 
the system. But this is simply a consequence of continuity, and 
nothing else. It is quite another thing to say, and this is the 
only important thing, that the rate of change of marginal utility of 
income is small. Today the use of infinitesimals and of differ- 
entials is rather out of fashion with mathematicians, although a 
completely rigorous basis can be given for these processes. But 
in the Victorian era when Marshall was in his prime, these were 
old familiar standbys, and we may be sure that the man who was 
second wrangler to the great Lord Rayleigh was well at home with 
their use. It is strange, therefore, that lie contented himself with 
gratuitous, and I believe incorrect, statements that changes in the 
marginal utility of income were of the “second order of smallness.” 
In the technical sense they are not ; the fact that we are dealing with 
what can be regarded as a second derivative is not relevant here. 

If it may be of comfort to anyone cherishing a fondness for 
constancy, the following formula may be given indicating how the 
relative amount of expenditure upon a good ties up with the elas- 
ticity of the marginal utility of income with respect to a change in 

** Perhaps King Midas was an exception, although his case involves certain dynamic 
considerations outside the present scope. For a one commodity maniac of his type a 
might not be inappropriate to measure all values in terms of the commodity in question: 
thus, the food necessary to keep a book collector alive might be reckoned as simply some 
fraction of a first edition of Adam Smith. 

11 Marshall, Principles, p. 842. The small germ of truth in the argument concerning 
"second order of smallness” lies in the fact that if each of the half dozen different 
"msumer's surplus concepts are plotted againts a variable price (later called pf), then 
the original point, pf, they all have the common tangent, — 5 ,. 
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the price of this good. 


Em v _ / Em v Exi \ 

EP<~ ki \~W + El)' 


( 55 ) 


Providing that the two expressions in brackets do not simultane- 
ously become large, and the-last of these must for all goods average 
out to unity, then our left-hand elasticity must go to zero in the 
limit as the per cent spent upon the good in question goes to zero. 
But then of course any “surpluses” connected with this good 
become small, and there is no reason to distinguish between the 
orders of smallness. 


Why Consumer’s Surplus is Superfluous 

Of course, the problem of constancy derives its interest, I will 
not say importance, from the Marshallian concept of consumer’s 
surplus, about which an earlier generation of economists were able 
to indulge in much argumentation. Since many of the points in- 
volved were essentially mathematical in character, and since most 
of the antagonists did not go beyond literary methods, the discus- 
sion was not able to advance beyond a certain point, although most 
of the essential difficulties with the concept had been brought out. 
Later when there came about a renascence of mathematical meth- 
ods in economic theory, economists had lost interest in the problem, 
and the subject lived on primarily in the elementary textbook and 
in the classroom. 

In the present writer’s opinion this is as it should be. The 
subject is of historical and doctrinal interest, with a limited amount 
of appeal as a purely mathematical puzzle. These statements arc 
made in cognizance of the fact that Professor Hicks has recently 
attempted to rehabilitate the doctrine of consumer’s surplus. It 
would carry us afield from the present task to analyze his conten- 
tions in detail, and so only a few ex cathedra remarks must suffice. 

In the first place, any judgment as to the usefulness or lack of 
usefulness of consumer’s surplus has nothing to do with the problem 
of the admissibility of welfare economics as a significant part of 
economic theory since nobody has ever argued that the latter sub- 
ject presupposes the validity of consumer’s surplus. Can it then 
be said that consumer’s surplus if not necessary, is nevertheless a 
useful construct? Concerning this psychological question, no ■ 
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answer can be given. Historically the important propositions 
concerning increasing and decreasing cost industries, which are 
attributed to Marshall’s consumer’s surplus notions, may be said 
at best to have been incomplete derivations, and at worst may be 
said to be absolutely incorrect statements which, by a pun or play 
on words, seem to resemble the Pigouvian doctrine concerning in- 
dustries with external economies and diseconomies. In its earlier 
form the Pigouvian doctrine is close to that of Marshall, but from 
the writings of Knight and Pigou himself we know that earlier form 
to be quite wrong . 80 To Pigou the problem is not at all one of 
increasing or decreasing returns, it is only a question as to whether 
each unit is taking account of its full effect upon social magnitudes 
(other than prices), or whether it is not. If it is not, and that is all 
that we mean by external economies, then there is ol course need 
to interfere with the "invisible hand.” { have found nothing in 
the written work of Marshall which suggests that he ever saw 
matters in this way, and even if he had stumbled upon this result 
by means of consumer's surplus, it would not be the first time that 
a correct theorem had been reached by incorrect, heuristic reasoning 
Also it may be said that the merits or demerits of the concept 
in question have little to do with the applicability of partial equi- 
librium methods to any particular problem. As for its connection 
with the theory of index numbers, after the concept has been 
renovated and altered, it is simply the economic theory of index 
numbers in the Pigou, Konus, Haberler, Staehle, Leontief, Lerner, 
Allen, Frisch, Wald tradition 11 

,a F H Knight, "Fallacies m the Interpretation of Social Cobt,” Quarterly Journal 
of Economics, XXXVIII (1924), 582-606 A C Pigou, The Economics of Welfare, 
4th edition (London 1932), chap, xi and appendix in 

11 In the Hicks derivations (Rmno of Economic Studies, vol IX, no 2, pp 126-137) 
certain well known theorems w hich are exact are derived as approximations Also his 
most interesting result, that the difference between the Laspeyre and compensating 
variation is equal to a generalized substitution term, is exactly true, not simply for small 
movements, being a transcription of the familiar notion that the two terms differ m 
consequence of the curvature of the preference field. The one application to welfare 
economics in section 8 would be of interest only in connection with a (misdirected) 
attempt to measure welfare in a cardinal sense to say whether one movement is better 
than the sum of the benefits of two other movements And even if one were interested 
in cardinalization of welfare, this would not be the way to do it, for it can be shown that 
the value sums which are used in index number theory are of importance only for the 
itative direction of change which they indicate, in general (except in the trivial case 
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If one were to begin afresh to give answers to the following 
problems, in no one of them would consumer’s surplus be necessary 
or desirable : Should Robinson Crusoe, a Socialist state, or a capital- 
ist economy build a particular bridge? Should indirect taxes be 
preferred to direct taxes? Should discriminatory prices be allowed 
if a uniform price will not keep an activity in business? Should the 
number of firms producing differentiated products be reduced, and 
in what way? Should a particular small industry be expanded or 
contracted by means of tax or subsidy? etc., etc. Aside from their 
extraneous inter-personal aspects, all of these questions can more 
conveniently (and more honestly !) be answered in terms of the con- 
sumer’s ordinal preference field. 

It is for these reasons that my ideal Principles would not include 
consumer’s surplus in the chapter on welfare economics except 
possibly in a footnote, although in my perfect Primer the concept 
might have a limited place, provided its antidote and alternatives 
were included close at hand. 

The Many Forms of Consumer’s Surplus 

The Marshallian expression of consumer’s surplus does not refer 
to any one thing but to at least half a dozen interrelated expres- 
sions. It is only too easy to accept tacitly one of these as primary 
and then to show that the others do not correctly measure this 
magnitude. Thus, Professor Viner 32 argues that Marshall is in- 
correct in using the area under the demand curve as an index of 
the gains from trade because this magnitude does not coincide with 
the amount which could be derived by an all or none offer, as if 

of expenditure proportionality) they cannot constitute even an arbitrary cardinal index. 
Some of these implications will be seen from an application of the Hicks result to what 
I have called the pure Marshallian case. Let us suppose that utility really is measur- 
able in a cardinal sense and is given by the additive logarithmic form mentioned above, 
so that demand for each good is unitary and independent of all other prices. Any two 
goods will nevertheless be substitutes in the Hicksian sense; if it seems strange that 
independence in the usual objective sense nevertheless implies substitutability in the 
latter sense, the reader is reminded of the formal definitions laid down by Hicks. Be 
cause each good is unaffected by a change in the other price, a change in both prices 
together leads to exactly the same cardinal change in utility as the sum of each change 
separately. Yet by the Hicks theorem of section 8 the combined reduction in price leads 
to a smaller “gain” than the sum of the two separate "gains,” an eminently gratuitous 
conclusion. 

52 J. Viner, Studies in the Theory of International Trade (New York: 1937), chapter ix, 
section iv. 
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the Jatter were the primary and correct expression for consumers 
surplus As Bishop has shown,” the tatter may be a worse ap 
proximation than the former to the cardinal utility gam which 
Bishop takes as the primary measure of consumers surplus 
Hicks first took the all or none magnitude as being the primary one 
but now adopts a fourth magnitude as primary, what he calls the 
compensating variation Which is correct can only be a question 
of history of thought, which would Marshall have selected if con 
fronted with a one or none offer? Unfortunately this is no longer 
an observably meaningful question Although I think Marshall 
would have agreed with Bishop, looking upon the other measures 
as good or bad approximations, 1 prefer to treat no one as primary 
or privileged, but to give the relationships between each 

This was done in brief but fairly exhaustive fashion in the 
Schultz \olume essay, from which I append the following few pages 
copied verbatim except for minor changes 34 

Before examining the Marshallian concept of consumer s sur 
plus let us consider the uses to which it is put Among other 
things it is proposed as a measure of the gam (loss) of utility that 
results from a decrease (increase) in price of a single good An 
attempt also has been made to apply it to the analysis of the burden 
involved m commodity taxation It has been used to determine 
the maximum amount of revenue that a perfectly discriminating 
monopolist might exact from the consumer for a given amount of 
the good in question 

Since only an ordinal preference field is assumed in the theory 
of consumer s behav lor, there is really little importance to be at 
tached to any numerical measure of the gams from a price change 
In particular, one cannot fruitfully compare the gam derived from 
a movement between two given price situations with the gatn be 
tween two other price situations 35 Moreover, all valid theorems 
relating to the burden of taxation can be stated independent^ of 
any numerical measure of utility change We should not be greatly 
perturbed therefore, if the concept of consumer’s surplus should 

** Op ct t p 421 passim 
11 Pages 87-90 

*' O n e can how ever compare the gains derived from a change m the basic pnee 
situation with an alternative price change from the same basic situation since th s 
resolves itself into an ord nal comparison of the alternate e new situations The mit al 
•lation cancels out so to speak 
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be found to be inadmissible. Its only advantage seems to lie in 
its deceptively easy two-dimensional representation. 

Consider an initial price and income situation, (pc, • • • , p n a , I a ), 
and the corresponding amount of goods purchased, (xi°, • • •, xj). 
For any selected utility index, <p, there will also be a given amount 
of utility, <p(X a ). Suppose that a change is made in but one price, 
p it and income is left unchanged. There will be new amounts of 
every commodity, (xp, ■ • -,x n b ), and of utility, <p(X b ), correspond- 
ing to the new prices and income, (pp, • • • , p n b , P), or (pc, • ■ 
pp, ' ’ ‘ > Pn a , I a ), where pi 3 is less than pc. 

We are interested in the following magnitudes: 

1. The gain (loss) in utility resulting from the price change, or 

<p(X b ) - <p(X»). 

2. The area between the demand curve of the ith. good and the 
pi axis within the range of the price movement, or 



3. The amount by which the expenditure on the ith good in 
the new situation is exceeded by the maximum amount of money 
which the consumer would be willing to pay for xp in preference 
to trading at the old set of prices. (This may be negative if we 
are dealing with a price increase rather than a decrease.) Call 
this b E ab . 

4. The amount of extra income which the consumer would in- 
sist upon if he is to be as well off as in the new situation while 
consuming the old amount of x,-. Call this a E a b. 

5. The change in income which will make trading at the new 
set of prices as attractive as trading at the old set of prices with 
the initial income. Call this b A I ab . 

(). The change in income which will make trading at the old set 
of prices as attractive as trading at the new set of prices with the 
initial income. Call this a A/ a &- 36 

According to the Marshallian doctrine of consumer’s surplus, 
all six of these magnitudes are equal except for dimensional con- 

36 Interchanging subscripts changes algebraic signs. Thus, 

b Alcb = - b Alab. “A lab, but not b Alcb nor b E a can exceed I. 
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stants We are explicitly warned, however, that his doctrine holds 
unqualifiedly only when the marginal utility of income is constant 
and only if utilities are independent I shall now examine the 
value of each of these magnitudes in four cases (a) in the general 
unrestricted case of stable demand, (b) under the first interpreta 
turn of constancy of the marginal utility of income, (c) under the 
second hypothesis when the tth good is not the numeraire, and 
(d) under the second hypothesis when the :th good itself has con 
stant marginal utility of income Only the most sketchy proofs 


will be indicated 

In the general case we have the following relations 

*(*> - *«*■> - f,\w) = r ? (sio ^ 

= i. m ’ ~~£. m - xdp ‘ (56) 

6 A/„<, = max (53 p b x, b — 53 P) b x>), where <p(X) = ip(X a ), (57) 

i i 

^ £pW- (58) 

S i (PS - pj)x, m (59) 

If only the ith price changes this becomes 

6 A/ a > ^ (ps - p?)xs (60) 

Similarly, 

■*A7 0& = min (53 PSx, — 53 PS X S) where <p(X) = <p(X b ) (61) 

i i 

•AU s £ p,-(x? - x,-), (62) 

l 

and, the ith price alone changing, 

*A/„5 S 53 pS(xf — xs) = (pS — pi b )x t b (63) 

i 
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Although this is not the Marshallian interpretation, consumer’s sur- 
plus seems to be most justified in this case. However, the above 
equalities cannot hold simultaneously for every good. 10 

For case (d) equation (67) must hold with the possible exception 
of the inequality referred to in a previous footnote, but (69) is 
definitely false. 

Other results may lie extended in at least two directions. More 
than one price may be permitted to change, and also income, with- 
out changing many of the inequalities 41 This may be left to the 
interested reader as an exercise. Also we may attempt to derive 
rigorous inequalities in the n commodity case. This offers con- 
siderable complexities in connection with concepts (3) and (4). 

49 Case (c) is sufficient to insure the equalities of equation (69) Some of them may 
hold in other cases 

41 In general, line integrals will replace simple integrals with the path of integration 
of the former a matter of no consequence 



CHAPTER VIII 
WELFARE ECONOMICS 

Beginning as it did in the writings of philosophers, theologians, 
pamphleteers, special pleaders, and reformers, economics has al- 
ways been concerned with problems of public policy and welfare. 
And at least from the time of the physiocrats and Adam Smith 
there has never been absent from the main body of economic litera- 
ture the feeling that in some sense perfect competition represented 
an optimal situation. Of course, over time the exact form of this 
doctrine has undergone modification (not always in any one direc- 
tion), and there is considerable diversity in the attempted proofs 
(in the amazingly few places where rigorous proof was attempted). 

Although this doctrine is often thought to be conservative or 
reactionary in its implication and to reflect the “kept” status of 
the economist, it is important to emphasize that it was “radical” 
in the eighteenth century, and there is some evidence from events 
of the last decades (e.g., the T.N.E.C. and economists’ role and 
views with respect to Anti-Trust) that it has become a thorn in 
the side of what are usually thought of as conservative interests. 
Furthermore, some Socialist writers, who in their youth became 
interested in analytical economics, find in this doctrine a possible 
device for expediting planning in a socialized state. 

Early uncritical allegiance to this doctrine arose in part from 
the understandable eighteenth-century tendency to find teleological 
significance in the workings of what is after all an equilibrium sys- 
tem which is not devoid of aesthetic content regarded simply as a 
mechanism . 1 But it would be unfair to the older economists to 
believe that their case ended with a simple argument from design, 

1 It would be out of place here to discuss the relationship of this doctrine to that of 
"natural rights”; to that of competition as an immutable law with which man cannot 
interfere even if he should wish to; to the Inverted doctrine of natural selection whereby 
the results of competition were judged to be best by means of a circular definition of the 
“fittest” as those who survive; to the Malthusian view that hardship and competition 
are necessary to bring out the “best” in a man; to the view that competition was good 
enough for our predecessors and therefore good enough for us; and * -.nts 

designed to preserve the status quo. 
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foundations of economic ana LI sis 
even if such a charge can be sustained with respect to certain 
Epigoni 

This can best be seen in the writings on International Trade 
where the issue of the tariff brings out most clearly the welfare and 
policy beliefs of economists, e\cn dowm to the present day For 
free trade is but one dramatic exemplification of pure competition, 
and in this field formal attempts at proof were made, or we can at 
least in many cases piece together the implicit beliefs of the author 

1 Perhaps the most common reason for believing competition 
to be optimal stemmed from the recognition that no party could be 
hurt by exchange as compared to his position before trade, since 
he could always refuse to trade Thus, trade ts better than no trade, 
exchange is mutually beneficial, one parly does not gain uhat the other 
loses If we examine the argument carefully, we find that it does 
not really imply that pure competition is optimal, even though 
properly interpreted it can provide a case against prohibitive tariffs 

2 A second more sophisticated argument, which includes the 
first and more rests upon the fact that the equilibrium position 
reached in pure competition represents an optimum for each indi 
\ idual consistent w ith his original endowment of commodities and 
the market situation with which he is confronted But ever} mdi 
v idual may be making the best of a situation without implying 
that that best is very good or is optimal, although each individual 
in pure competition takes price as given, for the market it is a 
variable, and it is quite possible that conditions other than pure 
competition might lead to better results m terms of any of the 
usual ethical notions But, leaving aside all ethical notions, is it 
not equally clear that under (saj ) monopoly , both bujer and seller 
are doing the best that they can for themselves under existing 
market situations 1 * The only distinguishing feature of pure com 
petition, as compared to any other mode of behavior, is that the 
market conditions facing each individual are taken (by him) to be 

straight lines” inv olving trade at unchanging price ratios And 
it is precisely the question of the sense in which this is optimal 
which is left unanswered 

It does not appear that Walras ev er reached beyond this second 

ge of the argument * His cardinal failing consists not so much 

* Compare the very penetrating remarks of Wicksell on this point K WicfcseU 

jres on Political Economy (Englsh translation New York Macmillan, 1934) 
I 72-83 
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in the fact that he jumps from incomplete premises to sweeping 
conclusions, but in the fact that he is satisfied with this very limited 
kind of an optimum, which by a play on words he seems to confuse 
with the more usual and important senses in which perfect compe- 
tition is conceived to be optimal. 3 

3. Still a third stage of reasoning attempts to show, not that 
each individual is made best off by competition since this is im- 
possible unless each can take all, but that in some sense the sum 
total of satisfactions is maximized, that perfect competition effects 
an ideal compromise of mutual benefit, or in its most nebulous 
form that free trade (perfect competition) maximizes world (all 
individuals’) income. Of course, this involves the notion of adding 
the utilities of different individuals, of somehow being able to com- 
pare and weight the utilities of different individuals. Although 
the marginal utility economists, with the exception of Jevons who 
made an interesting slip in connection with the concept of the 
utility of the “trading body,” knew that it was not necessary to 
make interpersonal comparisons of utility in describing exchange 
under pure competition, they nevertheless did not have the modem 
reticence about making such assumptions. 

Launhardt seems to have been the only economist who at- 
tempted to give rigorous proof of this theorem. As Wicksell has 
pointed out in the section just cited, his argument is mathematically 
and logically false. Yet he must be given credit for having made 
an attempt at rigor, and we can learn more from his unambiguous 
failure than from many pages of fuzzy literary effusion. 

To many modem economists the difficulty with this third line 
of reasoning lies in the fact that it assumes that the utilities of 
different individuals can be compared, in fact added together. 

3 If interpreted literally, he would seem to imply that each and every person is made 
better off by perfect competition, a conclusion which, as Wicksell observes, goes farther 
than the free traders themselves, "for the latter have not denied that, a restriction of free 
competition might be most advantageous to a small privileged minority.” Ibid., p. 76. 

Actually there is one qualification in Walras’ argument which makes it not so much 
wrong as trivial. He maintains that perfect competition creates a maximum of satis- 
faction, consistent with trading at uniform prices. Waiving the trivial objection that 
under nondiscriminating monopoly trading is also done at uniform prices, I find this 
confusing. Except for positions of multiple equilibria which we may provisionally 
ignore, the equilibrium position under competition is uniquely determined. Instead of 
being the optimum condition under these conditions, it is the only one possible. Thus, 
it is the worst position as well as the best. 
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This they would regard as “unscientific.” But to the preceding 
generation of economists, intcrindividual comparisons of utility 
were made almost without question; to a man like Edgeworth, 
steeped as he was in the Utilitarian tradition, individual utility — 
nay social utility — was as real as his morning jam. And with 
Marshall the apostrophe in consumers’ surplus was always after 
the s. 

Both Marshall and Wicksell objected to what they considered 
to be a prevalent notion that perfect competition leads to the 
maximum of satisfaction Both enter as a minor objection the fact 
that there may be multiple positions of equilibrium; actually this 
is largely irrelevant since each stable equilibrium might be a rela- 
tive maximum as compared to points in its immediate neighbor- 
hood (i« kletnen) even if it were not the maximum maximormn 
But their major objection consists in the fact that with existing 
distributions of wealth and ability, the processes of imputation 
under competition will give rise to great inequalities in the personal 
distribution of income so that unless individuals are very different 
in their natures the marginal utilities of income will not be equal 
for each individual Both recognize that in these circumstances 
any interference (& la Robin Hood) with perfect competition which 
transfers income from rich to poor would be beneficial. 

4. It might be thought that at this stage Marshall and Wicksell 
would enunciate a fourth proposition, that exchange under perfect 
competition is optimal provided the distribution of income is 
optimal. In the case of Wicksell the proof which he gives ( Lectures , 
p. 80) to show why perfect competition is not optimal when the 
distribution of income is inappropriate paves the way for a proof 
as to why perfect competition is optimal when the distribution of 
income is appropriate. 4 Wicksell also realizes that when the dis- 
tribution of income is not optimal, the creation of a condition of 

4 Actually his proof seems to suffer from one minor draw back. In effect, his evalua- 
tion of the change in utility resulting from a change in price from the competitive level 
assumes that in the noncompetitive situation all individuals are still on their offer curves 
Strictly speaking, this is not possible It would perhaps be correct to say that his proof 
(with slight modifications) shows that transfer of goods or income from one individual 
to another could not improve the competitive conditions There is also an unfortunate 
minor slip in expression, perhaps in translation, in the statement that “free competition 
would secure a maximum satisfaction to all parties to the exchange ” (Ibid , p. 81, my 
' cs ) Actually, it is the sum of all and not the utility oj each that is maximized. 
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imperfect competition, may improve the situation, but that this 
is not the best way of improving the situation, since perfect com- 
petition is a necessary condition to “maximize production.” I 
return to this point a little later in this historical review. 

Although inappropriateness in distribution is thought by Mar- 
shall to render the competitive position suspect, he was of the 
belief that many decisions involve alternatives which affect all 
classes more or less equally. He has been criticized for this too 
facile assumption, but it is nevertheless true that many modern 
economists, and this includes some purists, by use of the principle 
of sufficient reason (or is it insufficient reason?) argue in such terms 
for or against a price change of a commodity which is not presumed 
to relate more to the poor than to the rich. 

However, aside from problems raised by the inappropriateness 
of the distribution of income, Marshall had important objections 
to the equilibrium position realized under perfect competition. 
These objections resulted from his analysis of consumers’ surplus, 
an analysis which was regarded as almost the most significant 
contribution of his Principles. By a comparison of geometrical 
areas he arrived at the conclusion that increasing cost industries 
would be pushed to too great a margin under competition, and that 
the output of decreasing cost industries would be too small under 
competition. From the modern standpoint it is clear that these 
conclusions are true in only a very limited sense. And if Marshall 
did arrive at conclusions which are not completely wrong, it is 
nevertheless clear that he arrived at them for the wrong reasons. 

It is not easy in a few paragraphs to delineate the various faults 
in the Marshallian reasoning. In the first place, his exposition in 
Book V, chapter xiii, is extremely sketchy, and, in the second 
place, it is impossible to avoid the somewhat extraneous difficulties 
arising from the admittedly unsatisfactory treatment of decreasing 
cost by Marshall. However, the latter is simply the most dramatic 
exemplification of the paradox that Marshall, with whom the doc- 
trines of partial equilibrium and industry analysis are inseparably 
associated, nowhere presents a complete or satisfactory theory of 
the industry in its relationship to the firms which make it up. If 
anyone doubts this, he need only compare the treatment of these 
problems by Pigou, Marshall’s shining pupil, in his 1912 Wealth 
and Welfare with his treatment in the late editions of the Economics 
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of Welfare or with that of Vmer in the Zeilsckrtft article cited in 
chapter 111 

Another inadequacy but one that can be easily remedied lies 
in the fact that Marshall neglects producers surplus instead of 
treating this symmetrically with consumers surplus so that it is 
possible by the reasoning of page 468 footnote 2 to arrive at the 
curious conclusion that industries of increasing costs should in 
many cases be contracted c\cn if there are no decreasing cost 
industries to expand 

There seems to be no point in discussing the Marshallian reason 
ing at greater length except to note that Pigou nowhere makes 
essential use of the concept of consumers surplus in his welfare 
analysis He originally enunented essentially the Marshallian 
conditions with respect to increasing and decreasing cost indus 
tries but as a result of the criticisms of All) n Young Knight and 
Robertson he seriously modified these conclusions * In its final 
form his doctrine holds that the equilibrium of a closed economy 
under competition is correct except where there are technological 
external economies or diseconomies Under these conditions since 
each individual s actions have effects on others which he does not 
take into account in making his decision there is a prtma facte 
case for intern ention But this holds only for technological factors 
(smoke nuisance etc ) changes in factor pnccs resulting from the 
expansion of demand by firms in an industr> represent transfers 
which arc irrelevant for determination of ideal output (In all 
fairness it should be admitted that the correct use of consumers 
surplus and producers surplus might have helped to avoid error 
m this regard ) 

There would be no reason to rake oxer these old ashes were it 
not for the fact that Professor Hicks has recently lent the weight 
of his authority to the \ icw that the doctrine of consumers surplus 
has a claim to importance in the w elfare field * As 1 have indicated 
in chapter e n careful' perusaf of fns argument stmpfy confirms my 

* F H kn ght Fallac es in the Interpretation of Soc al Cost Quarterly Journal 
ofEconom cs (1923) Repr nted n The Elk s of Compel t on (XewVork Harper 1935) 
pp 215 236 Allyn Young I „ou a Wealth and Welfare Quarterly Journal of 
Fconomtcs XXVII (1913 ) 672-686 D II Robert on Those Empty Boxes Eco 
noma Journal XXXIV (1924) 16-31 

* J R H cks The Rehab 1 tat on of Consumers Surplus Revtm of Econo nic 
Suites V1H (1941) 108-116 



WELFARE ECONOMICS 


209 

belief that the economist — mathematical, literary, beginner, expert 
— had best dispense with consumers’ surplus. It is a tool which 
can be used only by one who can get along without its use, and 
not by all such. As Hicks admits, it is not useful in the exposition 
of the conditions of “equilibrium” or “optimum.” And even in 
the case of a Crusoe economy where the problems raised by many 
individuals can be sidestepped, it is usually only devised to give 
the loss in utility resulting from a deviation from the optimum in 
the amount of one good. 

In this connection its principal conclusion states that the 
(second order) change in utility resulting from a deviation in the 
amount of one commodity, other commodities continuing to be 
optimally adjusted, depends upon the amount of the discrepancy 
in that good times the discrepancy in the equilibrium condition. 
This conclusion is no more plausibly derived from consumers’ sur- 
plus than from simple intuition. And if one probes deeper, one 
finds in any case that the theorem is incorrect even to the order of 
infinitesimals (second) at which the argument is pitched. 

Thus in the most favorable case to consumers’ surplus where one 
commodity, x n+ i, has literally constant marginal utility so that 

U = L(x 1 , .v 2 , • • • , x„) + mx n+ 1 , (1) 

and where the goods can be converted into each other at constant 
technological rates as indicated by the relation 

71 -{-X 

L biXi = c, (2) 

1 

for this conclusion to be correct it would be necessary that the 
change in utility resulting from a small change in the amount 
of *1 be 

5 2 U = 0 T -§Xii§Xi 2 . (3) 

Actually, by a simple extension of the reasoning in chapter iii, it is 
given by 

8 2 U = O + IEL LijhXihXj. (4) 

x 1 

If we proceed to higher orders of infinitesimals, the case is worse, 
and the same can be said if we drop the unrealistic assumption 
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about the utility of the numeraire and if the original position is 
not one of equilibrium 

Even if consumers surplus did give a cardinal measure of the 
change in utility from a given change it is hard to see what use 
this could serve Only in the contemplation of alternative move 
ments which begin or end in the same point could this cardinal 
measure have any significance and then only because it is an tndi 
cator of ordinal preference Such situations are comparatively 
rare as far as questions of social policy arc concerned being m the 
nature of the somewhat academic question as to whether the intro 
duction of a little monopoly c\ il into one industry all others being 
competitive before and after the change is better or worse than 
introduction of some monopol> in another 

In connection with monopolistic competition the frequent oc 
currcnce of decreasing cost and indivisible initial costs inevitably 
raises problems of an all or none character Waiv ing the diffi 
culties arising from many individuals we sec that correct decisions 
necessitate reference to ordinal indifference curves and to nothing 
else Certain difficulties connected with the determination of the 
optimal amount of differentiation of product w ere properly posed by 
Chamberlin Cassels and Kahn without the use of consumers 
surplus 7 

We may conclude from this lengthy digression that after making 
due allowances for external economies and for certain omissions in 
their expositions the founders of neo classical economics believed 
that perfect competition led to an optimum in exchange and 
production provided that the distribution of income was appro 
pnate But they did not believe that incomes imputed by the 
competitive process as of a given historical distribution of owner 
ship of factors of production and personal abilities was in any sense 
the best one and not subject to modification by appropriate 
mechanisms 

5 Before analyzing the problems encountered under the head 
mgs of optimum production and optimum exchange I should 
like to note briefly the existence of economists who attempted to 

1 E H Chamber! n The Theory of Monopol site Compeltl on (3rd ed Camb dge 
Harvard Un vers ty Press 1938) p 94 J M Cassels Excess Capacity and Monopo- 
1 Stic Competit on Quarterly Journal of Economics LI (1937) 426-443 R F Kahn 
Some Notes on Ideal Output Eeonom c Journal XLV (1935) 1 35 
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establish the stronger position that incomes imputed under compe- 
tition were actually right and best. Thus at an earlier date, 
Bastiat, whose powers of analysis were hardly of the highest order 
even in his day, hoped to show that beneficient competition would 
lead to . . an amount of utility and enjoyment, always 
greater, and more and more equally distributed. ...” 8 

Confronted with the undeniable fact of considerable inequality 
of income and possessing the latent Western European prejudice 
against inequality, writers had either to refer to a future day when 
competition would achieve better results, or attribute existing in- 
equalities to the admittedly large institutional deviations from 
competition, or to look for inequalities among individuals’ charac- 
teristics (including property ownership) to justify differences in 
income. 

To anyone with knowledge of the world the perverse relation- 
ship between exertion and income made necessary a revision of the 
classical real cost doctrine in its simplest form, although the promo- 
tion by Senior of abstinence to the rank of a full-bodied real cost 
helped to bolster that doctrine. But ultimately refuge was found 
in the undeniable fact of differences in personal "ability” and the 
related doctrine of non-competing groups. This raised many ques- 
tions as to the extent to which the relevant abilities were or were 
not “acquired characteristics” and the degree of correspondence 
between the distribution of abilities and income. That much of 
this discussion was meaningless and from most points of view ir- 
relevant does not detract from its significance from the standpoint 
of the history of ideas. 

Among analytical economists J. B. Clark 9 is best known for 
his belief that not only will factors of production have imputed to 
them their marginal productivity under competition, but that this 
is a “natural law” which is “morally justifiable” since this is their 
"actual,” “specific” product. Indeed Clark himself considered 
that the principal way that his independently discovered marginal 
productivity doctrine represented an improvement upon von 
Thunen lay in his demonstration of its ethical fairness as compared 
to the latter’s belief that the doctrine involved exploitation. That 
Clark, who clearly states the distinction between personal and 

8 F. Bastiat, Harmonies of Political Economy (2nd ed.; Edinburgh, 1880), p. 

• J. B. Clark, The Distribution of Wealth (New York, 1899). 
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functional income should lnve thought that he had proved the 
ethical fairness of income determination under competition is 
simply a reflection of the fact that where emotional beliefs m right 
and wrong enter into analysis it is usually not to the advantage of 
the latter As we shall see even if all income resulted from per 
sonal services Clark s proposition is not consistent with widely 
held ethical \ lews and if it is accepted as a definition rather than 
a theorem it will be found to be consistent with no unambiguous 
ethical evaluation of different individuals welfare Nevertheless 
it has considerable appeal especially in a frontier society where 
each individual could be thought of as working by himself under 
conditions where his product could be identified Analytically 
it is almost precisely in these terms that Clark first perceived his 
doctrine going with painful slowness from the (broad) zone of 
indifference to the concept of the internal margin 

6 While Wichsell and Marshall held that competition would be 
optimal if the distribution of income were appropriate it was left 
for Pareto 10 to take the stronger position that competition pro 
duces a maximum d uhltle collectue regardless of the distribution of 
income and indeed even if the utilities of different individuals were 
not considered to be comparable An optimum position in this 
sense was defined by the requirement that there should not exist any 
possible lanation or movement which would make everybody better off 

His discussion is not easy to follow and it has not receded 
attention from economists commensurate with the importance 
which he attached to it Yet it forms the basis of many modern 
notions and it led directly to the important contribution, of Barone 
Pareto also seems to have been one of the first to discuss criteria 
of planning under collectivism 

Pareto s exposition is complicated by the fact that he works 
with differentials or first order (infinitesimal) variations This was 
a very common practice with mathematicians and physicists of 
the nineteenth century and because of its formal heuristic con 
vemences is still often used today And under proper quahfica 
tions this practice can be given a rigorous unambiguous basis 
Nevertheless where delicate problems of interpretation are m 
volved it often obscures more than it reveals especially if the prob 

8 V Pareto Manvtl D Economic P oblique (1909) chap v also the Mathematical 
Append * par 89 patstm 
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lem arises as to whether any given differential expression is an 
“exact” differential. 

Pareto is unwilling to add the utility or ophelimite' of different 
individuals together, either in toto, i.e. (U 1 + U 2 +•••), or for 
small variations (5U 1 + 5U 2 + • • •). For that would involve a 
comparison of different individuals’ utility, and in addition it would 
depend upon the particular cardinal index of ophelimite' selected for 
each. But he was interested in comparing the summed variation 
in the utility of each, after these expressions have been divided by 
the marginal utility of any one good, a, selected as numeraire. 
I H or if we examine the dimensionality of the expression 

w m + 'h iVi+ (5) 

where as usual subscripts represent partial differentiation, but 
where the superscript is taken to indicate different individuals, we 
shall see that this has the dimensionality of the good, a, and 
nothing else. 

Pareto attempts to show that if the original position is one of 
equilibrium under perfect competition, then no possible variations, 
consistent with the fundamental scarcity of goods and given tech- 
nology, can make the above expression positive. If it could, he 
says, it would be possible to arrange things so that each term in 
the expression could be made positive, and then everyone would 
be better off. But the expression cannot be made positive. Ac- 
tually, regarded as a differential expression (of the first order), the 
above expression can be shown to be zero, in consequence of the 
fact that each commodity is sold at minimum unit cost (propor- 
tionality of marginal products, etc., 11 and in consequence of the 
tangency of each individual’s indifference curves to mutual price 
exchange loci. If recourse is had to differentials of higher order, 
the secondary maximum-minimum conditions of firms and indi- 
viduals will guarantee that the expression (5) will be negative for 
all finite deviations from the competitive position, or so Pareto 
attempted to show. 

Although Pareto’s treatment is somewhat sketchy and in need 
of expansion, part of which Barone later provided, the main out- 
lines are reasonably clear. But in connection with the funda- 

11 Ibid., p. 646. 
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mentals of interpretation of the significance of his maximum, there 
do arise certain problems First, can the differential expression of 
(S) be regarded as the exact differential of some expression? Ac 
tually, Pareto later gives a name to this expression, calling it SU, 
but is there an expression U (social utility ? ) of which this is an 
exact differentiaP Pareto does not tell us, but presumably he 
would answer, no, if he were on guard when the question was asked 
As we shall see, Barone does work with an expression whose differ 
ential corresponds to (5), but he clearly recognizes that it is a 
construction, not mxolving the dimension of utility, but rather 
that of the numeraire good 

But the most important objection to Pareto’s exposition is his 
lack of emphasis upon the fact that an optimum point, in his sense 
is not a unique point 13 If transfers of income from one individual 
to another are arbitrarily imposed, there will be a new optimum 
point, and there is absolutely no way of deciding whether the new 
point is better or worse than the old His optimum points const! 
tute a manifold infinity of values This locus can be obtained 
under regimes quite different from perfect competition (eg, by 
multilateral monopoly) Within Pareto's system it is impossible 
to decide, by his differential criterion or otherwise, which of two 
points on what may be called the “generalized contract locus” are 
better, or even that a given movement off the contract locus and 
hence to a non optimal point is good or bad Actually in terms 
of the wider reference schemes of ordinary economic thought such 
a movement may be deemed eminently desirable But Pareto 
shows that however desirable such a move may be, there exists still 
a better move, which for the same (ordinal) amount of harm to 
those who “should” be harmed, will yield more benefit for the 
worthy ones who are to be benefited This is an important 
contribution 

7 la a smsteriy arttcb, arsttca at .JPAS /» J taitaa, but o&t 
translated into English until 1935, ,s Barone developed further and 
m greater detail the Paretian conditions of optimum, especially as 

a In his earlier Cours discussion II 90 ff he explicitly assumes the distribution of 
income to be * convenable but in his later Manuel treatment the dependence of the 
optimum point on the initial distribution of income and hence its lack of uniqueness 
is not brought to the foreground 

u Reprinted as an Appendix in F A Hayek ed Collectivist Economic Planning 
p dge 1935) pp 245-290 
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they relate to the planning of production under collectivism. By 
avoiding all mention of utility and indeed without introducing even 
the notion of indifference curves, Barone was able to break new 
ground along lines which have in recent years become associated 
with the economic theory of index numbers. 

Unlike most of the writers discussed above, Barone is unsatisfied 
with the statement that free competition maximizes product, or 
sums of product, which can then be distributed in any given 
fashion. Heterogeneous products cannot be added. Further- 
more, leisure may be preferred to the maximization of output. It 
is significant that those writers who do not explicitly introduce the 
equations of general equilibrium should gloss over the definition of 
“product” which is allegedly maximized. Thus, Wicksell 14 con- 
fines his demonstration to a case where the same product can be 
provided by different sources, and only in this case shows that 
various marginal conditions are optimal. The same is true of the 
very excellent treatment by Knight, in which the movement of 
goods over alternative roads is analyzed for optimum conditions . 15 

It is remarkable that Professor Pigou, who reaches substantially 
correct conclusions, never squarely meets the problem of the defini- 
tion of social product. His index number discussion represents an 
important contribution in its own right, but it is offered at best as 
an approximate criterion or indicator of changes in individual and 
social welfare. He would not seriously suggest that the thing to 
be maximized is the money value of output deflated by an ideal 
index of prices. Nor will the more exact limits of index number 
theory such as are discussed in chapter vi avail. 

Barone proposes to add different products after they have been 
weighted by their respective prices; it is usually taken to be con- 
venient to express these prices as ratios to the numeraire good, a. 
For Barone, productive services can be treated simply as alge- 
braically negative goods and services. Thus, decisions between 
more or less work can be included in his welfare system. Then if 
the sum total of each good consumed by all individuals together is 
written as 

A = a> + c - 2 + • • • 

B = b 1 + b* + • • • ( 6 ) 

etc., 

M Wicksel!, op. oil., p. 140, passim. 

15 Knight, op. cit., p. 219. 
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and recalling that 

SU — U a to + Ub Sb + (7) 

we can write Pareto s equation (5) above in the equivalent form 

l(to‘ + to 2 + ) + PbiHb 1 + 5b 2 + ) + 

— 5A -f- pi,5B -f* (8) 

In passing from (4) to (8) use is made of the fact that the ratios of 
marginal utilities of two goods for each individual are equal to their 
price ratios Barone himself does not use this terminology but 
no doubt he would ha\e to if the connection with Pareto were to 
be shown 

The expression m (8) can be regarded as the variation in the 
following expression when prices are regarded as constants 

* = A+ p>B+ (9) 

If productive services were knowm to be constant so that they could 
be neglected (9) would equal (except for dynamic factors involving 
capital which we can ignore) money value of national product If 
all productive factors are included it will represent the net differ 
ence between value of consumers goods and the return to produc 
tive services Under many assumptions this quantity must be 
zero under the full conditions of perfect competition 

Barone shows that perfect competition maximizes this expres 
sion prices being taken as fixed parameters i e any variation from 
a condition of price equal to minimum cost must make 6$ as given 
in (8) negative Thus if we are at conditions other than perfect 
competition with (8) not equal to zero for all possible variations 
it w ill be possible to specify a movement which will make 5$ posi 
tive But we can think of 5<J> as being made up of the sum of a 
similar expression referring to each individual 

g<j> = + Sep* + = (to 1 -f p b Sb l -f- ) 

+ (to 2 + + ) + (io) 

If for any movement the total is positive it is not necessary 
that each and every one of the indn idual s Sip be positive but it 
is necessary that those which are positive should outweight those 
which are negative Thus those who are hurt could be compen 
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sated by those who are helped, and there would still be a net gain 
left to be parceled out among the individuals. 

This is essentially the gist of the Barone argument. The one 
point which will occur to the critical reader is the fact that arbi- 
trary prices are assumed in evaluating the expression to be maxim- 
ized. Which prices are to be used? Barone employs the prices 
which prevailed before a contemplated break with the conditions 
of competition are made, and this suffices if one merely wishes to 
demonstrate that not all individuals can be improved by any 
departure from competition. 16 

Unlike Pareto, Barone satisfies himself with deriving optimal 
production conditions without going into the fact that under compe- 
tition no additional individual exchanges of fixed amounts of goods 
would be mutually profitable. No doubt this oversight resulted 
from his wish to avoid the use of indifference curves and utility, 
but even without these constructions, by the use of the index num- 
ber notions which he pioneered, the enlarged conditions of exchange 
could have been included. It is a tribute to this work that a third 
of a century after it was written there is no better statement of the 
problem in the English language to which the attention of students 
may be turned. 

8. The next writer who deserves our attention is A. P. Lerner, 
who comparatively recently developed, presumably independently, 
the Paretian conditions which show that the marginal equivalences 
realized by perfect competition lead to an optimum of production 
and exchange in the special senses discussed above. 17 Actually, in 
the field of production his statement of the problem is slightly 
different from that of Pareto and Barone. They showed that a 
movement to conditions of perfect competition in the field of pro- 
duction and cost could make everyone better off because they could 
be given more of every good. But they still worked with indi- 
viduals. Even in a collectivized state where the existence of the 
individual is not assumed, the Lerner formulation of the sense in 
which output is optimal would still hold : the marginal equivalences 
of competition are such as to give a maximum of any one product for 

15 Actually, Barone discusses varying prices in a passage which seems obscure to me. 
Op. cit., p. 255. 

17 A. P. Lerner, “The Concept of Monopoly and the Measure of Monopoly Power,” 
Review of Economic Studies, I (1934), 157—175. “Economic Theory and Socialist 
Economy,” Review of Economic Studies, II (1934), 51-61. 
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given specified amounts of all others. This is almost identical with 
the Parcto-Baronc production propositions, but not quite. 

Professor Hotelling, also presumably independently, developed 
in two articles ls conditions closely related to the Pareto production 
and exchange conditions of optimum In particular, he has in- 
sisted upon the fact that marginal rather than average costs provide 
the appropriate basis for pricing, and he dc\ eloped the dramatic 
applications of this to the problem of railroad rates and decreasing 
cost public utilities in general On the analytic side each thing 
which he sets out to prove he docs prove with great elegance and 
generality, but his fundamental primitive assumptions arc only 
implicitly related to each other and to the equations of general 
equilibrium. Moreover, his welfare work really falls into two dis- 
tinct headings, on the one hand, that of the first article cited and 
much of the second article, and on the other hand, that of the 
second section of his second article referring to the "fundamental 
theorem” (especially pp 24S-256). Roughly these two diverse 
contributions of Hotelling fall respectively under the headings of 
optimal production and optimal exchange conditions; or, on the 
analytic side, to the difference between firms with unlimited 
budgets and the consumer with limited budget, to each of which 
fields Professor Hotelling has contributed much in the way of 
demand analysis. This dualism explains why so discerning a 
reader as Professor Trisch should have been puzzled by the 
Hotelling proof ** 

'• H Hotelling, "The Gcnenl Welfare m Petition to Problems of Taxation and of 
Railway and Utility Rates,' Economrlnca, \ I (I93S), 242-269, "Cdgeworth's Taxation 
Paradox and the Nature of Demand and Supply Functions," Journal of Political Econ- 
omy, XL (1932), 577-4H6 

'* Space cannot permit a detailed examination of the exact steps m the Hotelling 
reasoning, this being particularly unnecessary since it is clear that his conclusions are 
impeccable In the original specification of his system Professor Hotelling essentially 
generalizes the Dupuit-Marshall partial equilibrium set up to many interrelated indus- 
tries However, unless we confine ourselves to the production problem alone, this will 
not lead to the equations of general equilibrium 1 hese require the addition of the 
special demand functions of consumers for good* and their supply functions of productive 
services In the mixed consumer firm system the intcgrabihty conditions which give 
meaning to Hotelling s line integral, dead loss, and price potenti tl (equal to the Barone 
«t» function) are not satisfied Nor when it came to interpretation would it matter for 
the validity of the Pareto Barone Lemcr conditions if they were While Hotelling 
gives separate consideration to consumers when discussing excises, the two treatments 
•- never adequately integrated 
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9. The last writer to be mentioned is Professor A. Bergson. 20 
He is the first who understands the contributions of all previous 
contributors, and who is able to form a synthesis of them. In 
addition, he is the first to develop explicitly the notion of an ordinal 
social welfare function in terms of which all the various schools of 
thought can be interpreted, and in terms of which they for the first 
time assume significance. In view of his own very generous ac- 
knowledgments of the work of others, even where he himself had 
independently rediscovered many basic theorems in an improved 
form, it is regrettable that his contribution has received so little 
notice. No doubt this stems in part from the mathematical 
character of his exposition, and to the fact that he uses the rather 
difficult notation of differentials throughout. The analysis that 
follows is simply an enlargement and development of his impor- 
tant work. 21 


The Social Welfare Function 

It is fashionable for the modem economist to insist that ethical 
value judgments have no place in scientific analysis. Professor 
Robbins in particular has insisted upon this point, 22 and today it is 
customary to make a distinction between the pure analysis of 

20 A. Bergson, "A Reformulation of Certain Aspects of Welfare Economics,” 
Quarterly Journal of Economics, LII (1938), 310-334. 

21 In recent years Kaldor and Hicks have given an exposition of certain aspects of 
welfare economics. N. Kaldor, “Welfare Propositions in Economics,” Economic Journal, 
XLIX (1939), 549-552. J. R. Hicks, "Foundations of Welfare Economics,” Economic 
Journal, XLIX (1939), 696-712. Mention should also be made of an important article 
which indicates the modification in the Pigouvian analysis necessitated by the considera- 
tions of monopolistic competition. R. F. Kahn, “Some Notes on Ideal Output,” 
Economic Journal, XLV (1935), 1-35. A convenient compact summary of welfare 
economics is provided by O. Lange, “The Foundations of Welfare Economics, Eco- 
nomelrica, X (1942), 215-228. An advance in the discussion is represented by T. 
Scitovsky, "A Note on Welfare Propositions in Economics,” Review of Economic Studies, 
IX (1941), 77-88. Because discussions of Free Trade illuminate the beliefs of economists 
upon these matters, and because this subject provides a convenient illustration, it 
would be desirable to review its literature. However, reference can only be made here 
to the survey in J. Viner, Studies in the Theory 0 } International Trade (New \ ork: Harper, 
1937); to T. Scitovsky, “A Reconsideration of the Theory of Tariffs,” Review of Eco- 
nomic Studies, IX (1942), 89-110; P. A. Samuelson, “Welfare Economics and Inter- 
national Trade,” American Economic Review, XXVIII (1938), 261-266; "The Gains from 
International Trade," Canadian Journal of Economics and Political Science, V (1939), 
195-205. 

22 L. Robbins, An Essay on the Nature and Significance of Economic Science (London, 
1932). 
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Robbins qua economist and Ins propaganda, condemnations, and 
policy recommendations qua citizen In practice, if pushed to ex 
tremes, this somewhat schizophrenic rule becomes difficult to ad 
here to and it leads to rather tedious circumlocutions But in 
essence Robbins is undoubtedly correct Wishful thinking is a 
powerful deterrent of good analysis and description, and ethical 
conclusions cannot be demed in the same waj that scientific 
hypotheses are inferred or verified 

But it is not \ahd to conclude from this that there is no room 
in economics for what goes under the name of "w clfare economics ' 
It is a legitimate exercise of economic analysis to examine the con 
sequences of various \ahtc judgments, whether or not they arc 
shared by the theorist just as the study of comparati\c ethics is 
itself a science like an> other branch of anthropolog> If it is 
appropriate for the economist to ana!>zc the way Robinson Crusoe 
directs production so as to maximize his (curious) preferences, the 
economist docs not thercb> commit himself to those tastes or in 
quire concerning the manner in which thc> were or ought to have 
been formed No more docs the astronomer who enunciates the 
principle that the paths of planets are such as to minimize certain 
integrals care whether or not these should be minimized neither 
for all we know do the stars care Our alxnc historical review 
should show that there is meat) and weighty content to the field 
of welfare economics without invoking new methods m economic 
thought In sa>ing this I do not mean to imply that the field of 
welfare economics lias scientific content because a number of its 
theorems do not require inter personal comparisons of util it) , this 
after all is a mere detail That part which docs involve inter 
personal comp insons of utility also has re il content and interest 
for the scientific analjst even though the scientist does not con 
sider it any part of Ins task to deduce or venf) (except on the 
Je\e3) the vihi „ judgments nh&si implications be 
grinds out In the same way, the mathematical theory of prob 
abilit) accepts as a primitive undefined assumption whose v jhdit) 
is not its concern the initial specification of “equally likely” events 
the measure of various * clashes or the collective, and it then 
proceeds to grind out the mathematical implications of these and 
subsidiary hypotheses It is only fair to point out, however, that 
the theorems enunciated under the heading of welfare economics 
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are not meaningful propositions or hypotheses in the technical 
sense. For they represent the deductive implications of assump- 
tions which are not themselves meaningful refutable hypotheses 
about reality. 

Without inquiring into its origins, we take as a starting point 
for our discussion a function of all the economic magnitudes of a 
system which is supposed to characterize some ethical belief— that 
of a benevolent despot, or a complete egotist, or “all men of good 
will,” a misanthrope, the state, race, or group mind, God, etc. Any 
possible opinion is admissible, including my own, although it is best 
in the first instance, in view of human frailty where one’s own 
beliefs are involved, to omit the latter. We only require that the 
belief be such as to admit of an unequivocal answer as to whether 
one configuration of the economic system is “better” or “worse” 
than any other or “indifferent,” and that these relationships are 
transitive; i.e., A better than B, B better than C , implies A better 
than C, etc. The function need only be ordinally defined, and it 
may or may not be convenient to work with (any) one cardinal 
index or indicator. There is no need to assume any particular 
curvature of the loci (in hyper-space) of indifference of this func- 
tion. Utilizing one out of an infinity of possible indicators or 
cardinal indices, we may write this function in the form 

W= W(z h z 2 ,---), (11) 

where the z’s represent all possible variables, many of them non- 
economic in character. 

Between these z’s there will be a number of “technological” 
relations limiting our freedom to vary the s’s independently. Just 
what the content of these technological relations will be depends 
upon the level of abstraction at which the specifier of the value 
judgments wishes to work. If he is an out and out Utopian he may 
wish to ignore various institutional relations regardless of their 
empirical importance; indeed, he may go all out and repeal the laws 
of conservation of energy and widen greatly the technological 
productivities of the system. On the other hand, he may wish to 
take as fixed and immutable all social and economic institutions 
except those relating to the Central Bank. (Indeed those of fatal- 
istic temperament may regard the restraints to be so numerous as 
to leave no problem of choice.) In other words, the auxiliary 
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constraints imposed upon the \ ambles arc not thcmscKes the 
proper subject matter of welfare economics but must In taken as 
givens 

Subject to these construnts which may l>c written m the most 
general way as 

«(** ) “ 0 ( 12 ) 
tlicrc will presumably l>e in upper bound to If (even though no 
unique value of the :s need correspond to this nmirmim level) 
If certain assumptions of rcgu!ant> are m uh it would l>c possible 
to tndicatc formal conditions for the maximum involving l.a 
grange an multipliers matrices rink and qua Intic forms definite 
under constraint etc However there is no particular point m 
dev eloping this formalism 

The subject cou! I tn I with these iMnalities were it not for the 
fact tint numerous individuals hnd it of interest to 'pcciahrc 
the form of IT the nature of the variables z ind the nature of 
the constraints 

(1) I or one thing prices are not usually included! m the welfare 
function itself except vtr> indirect!) thnmgb the effects of differ 
ent prices and wages upon the qu intitas of consumption work etc 

(2) Also cert un of the v inaliles cm lx. thought of as referring 
to a particular indiv tdunl or family c g one of the z s ma> lie the 
amount of tea consumctl tiy John Jones or the amount of unskilled 
labor which he provides 

(3) It is often further issuuicd that the quantities of a given 
commodity consumed by one individual arc of the same type as 
those consmneel by another technically this means that certain 
of the \ arnbles enter in the technical side*conditions in sums w Inch 
relate the total amount produced of a commodity to input regard 
less of the ultimate distribution of that output Whatever may be 
said about the admissibility of this the matter is still worst when 
a similar assumption is made concerning the homogeneity of the 
\ anous scrv ices prov idcd by different indiv iduals Howcv cr even 
if m the most rigorous sense each individual s talents arc unique 
society rarely has the time or patience to learn to appreciate the 
flavor of each man and in the absence of perfect screening of 
different individuals it does treat them as if they were perfectly 
substitutable thus for our purposes they may be assumed to be so 
n many cases This docs not mean that vve work wath a single 
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grade of labor; on the contrary, the number of grades may be very 
large and the classification minute, but it is assumed that there 
are many individuals in each grade, actually or potentially. 

(4) Not infrequently it is assumed that a given grade of pro- 
ductive sendee may be used indifferently in a number of uses. 
Technically this means that certain of the z’s enter into the welfare 
function only as certain sums. As Bergson has pointed out, this 
involves implicit value judgments, so that Robbins when he dis- 
cusses the problem of allocating resources as between alternative 
uses so as to maximize (in some sense) output or personal utility 
is not able even on the Robinson Crusoe level to avoid these no- 
tions; or rather, if resources are not assumed indifferent between 
at least two uses, few interesting marginal conditions can be 
deduced. 

(5) A more extreme assumption, which stems from the indi- 
vidualist philosophy of modern Western Civilization, states that 
individuals’ preferences are to “count.” If any movement leaves 
an individual on the same indifference curve, then the social welfare 
function is unchanged, and similarly for an increase or decrease. 
Actually, an examination of the principles of jurisprudence, the 
folkways and mores, shows that in its extreme form this assumption 
is rarely seriously proposed. Even “sane” adults are not permitted 
to eat and drink what they think best, individuals cannot sell 
themselves in order to consume more in the present, milk ration 
tickets cannot be exchanged for beer at the will of the owner, etc. 

But economists in the orthodox tradition have tended to con- 
sider the above cases as exceptions. 23 However, in recent years 
many economists, Frank Knight being a notable example, have 

23 Consider, however, the following interesting quotation from Edwin Cannan who 
was very much in the classical tradition: “We shall never decide whether to put a penny 
on beer or to further steepen the super-tax on incomes by considering how much the 
loss of a penny pinches the beer-drinker and the duke: we shall, and we do, decide it 
by making some rough estimate of the aggregate advantage in the long run of the two 
methods to society at large. For example, if we find that cheaper beer means better 
food for underfed children while less super-tax means more training of horses to run fast 
for a short distance with a very light burden, we incline to the super-tax: but if we find 
cheaper beer means more beer for drunkards and less super-tax means more houses for 
the people to inhabit in comfort and health, we incline to the beer tax. From an 
Economic Journal review of Sir Josiah Stamp’s Fundamental Principles of Taxation in 
the Light of Modern Developments, reprinted in An Economist's Protest (London, 1927), 
p. 279. 
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insisted upon the degree to which individual tastes and wants are 
socially conditioned by advertising and custom so that they can 
hardly be said to belong to him in any ultimate sense All this is 
recognized m the witticism of the soap box speaker who said to the 
recalcitrant listener, 'When the revolution comes you will eat 
strawberries and cream, and like it 1 Attention should also be 
1 ailed to the fact that even the classical economist does not literally 
have the individual in mind, so much as the family, of course some 
hardy souls will pursue the will o’ the wisp of sovereignty within 
the family so as to reduce even these collective indifference curves 
to an individualistic basis 

(6) One does not have to be a John Donne then to find fault 
with the above assumption, especially if we consider the closely 
related assumption that an individuals preference depends only 
upon the things which he consumes and not upon what others 
consume As Veblen characteristically pointed out, much of the 
motivation for consumption is related to the fact that others do or 
do not have the same thing Conspicuous expenditure, “keeping 
up with the Joneses," snob appeal, maintenance of face, are un 
portant m any realistic appraisal of consumption habits, and if we 
turn to the field of power analysis, it is not only on the national 
scale that satisfactions" are relative and dog in-the manger tactics 
ire rational 

If this sixth assumption is not made, many of the conclusions 
of welfare economics will remain valid, but they will require modi 
fications to allow for certain external” consumption economies 
not dissimilar analytically to the external technological economies 
ind diseconomies of the Marshall Pigou type 

(7) All of the above assumptions are more or less tacitly ac 
cepted by extremely divergent schools of thought The next as 
sumption involves a more controversial value judgment, but one 
which has been characteristic of much of modem thought of the 
last century, and which is especially typical of the beliefs of the 
classical and neo classical economists It is that the welfare function 
is completely (or very nearly) symmetrical with respect to the con 
sumption of all individuals 

Taken in connection with previous assumptions, m its strict 
form it is not consistent with the patent fact of considerable differ 
ences in individuals' overt preference patterns Thus, m addition 
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to involving a very significant value judgment, it also involves a 
very definite assumption of fact. This was not appreciated by 
economists, who tended to believe in the desirability of an equality 
of income, leaving it to the individual to determine the exact form 
of his consumption. However, it is easy to show that the rule of 
equality of income (measured in dollars, numeraire, abstract pur- 
chasing power) applied to individuals of different tastes, but made 
to hold in all circumstances, is actually inconsistent with any de- 
terminate, definite W function. Equality becomes a fetish or 
shibboleth, albeit a useful one, in that the means becomes the end, 
and the letter of the law takes precedence over the spirit. 

For the decision of equal incomes as optimal in one situation 
implies a certain relative well-being as between vegetarians and 
non-vegetarians ; at different relative prices between vegetables 
and non-vegetables an equal distribution of income can no longer 
be optimal. Actually this does not render invalid the reasoning 
based upon this seventh premise since the adherents of this view 
implicitly held that individuals were very much alike, and if given 
equality of treatment would develop the same want patterns. 
Moreover, they could in all logic take the milder position that a 
great deal less inequality than existed in real life would be desirable 
even if one did not believe in complete equality. 24 

In a similar way, the belief that the individual should rightfully 
receive his imputed productivities is not consistent with a W func- 
tion having properties (1) through (6). A change in the techno- 
logical situation will alter individuals’ fortunes so that the final result 
cannot be optimal if the initial situation was deemed so. Perhaps 
the bourgeois penchant for laissez-faire is the only case on record 
where a substantial number of individuals have made idols of 
partial derivatives, i.e., imputed marginal productivities. 

One could similarly multiply expressions and beliefs which are 
part of everyday parlance, which upon examination turn out to be 
inconsistent and meaningless. The slogan, “the greatest good for 
the greatest number,” was shown by Edgeworth to be one such; 

24 It is not the purport of the above lines that the use of a welfare function leads to 
a belief in inequality as compared to equality. It merely shows that equality of money 
income where there is diversity of tastes involves the equality of nothing important. 
It is in lesser degree like the Anatole France aphorism concerning the equality o t e 
law in its treatment of rich and poor. Before Bergson’s treatment one could * 
sensed, but not analyzed definitively, this subtlety. 
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and we might add the dictum each to count for one and only one 
As Professor Knight has ceaselessly insisted Western Man is a 
hodgepodge of beliefs stemming from diverse and inconsistent 
sources Fortunately his life is sufficiently compartmentalized so 
that he can assume his various roles with a tolerable amount of 
ambiguity m each and only the most introspective worry about 
this enough to become disorganized 

(8) A final essentially unnecessary assumption which was espe 
cially characteristic of the last generation of economists was the 
definition of the welfare function which was to be maximized as the 
sum of cardinal utilities experienced by each individual Before the 
time of Bergson this was not uncommonly met even in the advanced 
literature and vestiges can be found today It stemmed from the 
main Utilitarian stream of economic thought when utility was used 
interchangeably in a behavioristic in a psychological in a physio 
logical in an ethical sense 

It was not uncommon for older writers to ponder over the 
question as to whether utility was being maximized or whether 
pain was being minimized whether by and large man was operating 
in the red below absolute zero but making the best of a bad lot 
The answer depended upon the author s theology and endocrine 
glands at the moment Paley Sidgwick and others could sen 
ously ask whether it was better to have a tremendous population 
each contributing a little to a vast amount of Social Utility or 
whether it was better to have less Social Utility provided its 
average amount per head was maximized 

In the field of Public Finance the assumption of additive mdi 
vidual utilities plus the law of diminishing utility were used to 
justify the imposition of progressive taxes In its most sweeping 
form this doctrine set up minimum aggregate sacrifice or maximum 
total utility as appropriate goals of action This goal can be ob 
tamed only if the marginal utility of income (after taxes) is equal 
for all individuals or if individuals are essentially alike only with 
equal incomes for all 25 On the other hand the criterion that a 
given sum of taxes should be raised so as to lead to equal sacrifice 
for all is a much more conservative doctrine following it we can 

“ F Y Edgeworth The Pure Theory of Taxat on Economic Journal VII 
897) 550-571 
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only be sure that taxes should increase with income, but not 
necessarily in proportion with income . 20 

Today such arguments are not very fashionable since it is as 
easy to assume one’s conclusions with respect to appropriate policy 
as to assume the premises of these arguments. Not only is the 
former more direct, but it is more honest. Nevertheless, some of 
the considerations which enter into the above arguments are latent 
in much of modem discussion and thought. In the distribution 
of war burden the moderately well-to-do tend to point to their 
rather large sacrifices and to call upon the lower classes to share in 
these and make new ones. The relatively poorer farmers and 
laborers concentrate upon how much the rich have left after they 
have made substantial sacrifices, and how little they themselves 
have in any case, not in comparison with what they had before the 
war, but compared to what they consider fair. Whether war 
times are the appropriate times to redress ancient wrongs or not, 
it is too much to expect that the bargaining advantages which the 
war brings will not be used for this purpose. 

In connection with this eighth assumption it was implicitly 
assumed that real income could be treated as a homogeneous quan- 
tity to be distributed among individuals. This could be literally 
true only in a one-commodity world, or in a world where all relative 
prices were fixed constants. Actually prices will vary depending 
how money income is distributed. Strictly speaking, therefore, 
the real judgments embodied in the welfare function must be judg- 
ments concerning a multitude of diverse goods. This would be 

26 The condition of equality of sacrifice is satisfied at each level of income if 
U(X ) — U(X - t) = constant for all X. 

Differentiating this so as to determine the explicit change in t with respect to X, 

dt_ U'(X - 0 - U'(X) 

dX ~ u\x - 0 

Because of diminishing marginal utility this is positive. But if we wish to have pro- 
gressive taxation, the elasticity of income after taxes against income before taxes must e 
less than one. But 

X d{X - t) XU’jX) 

X-t dX ~ {X- t)U’(X - t) ’ 

which is less than one if, and only if, the elasticity of the marginal utility curve is less 
than unity. Thus, for Bernoulli’s law of utility, equality of sacrifice would imply pro- 
portional rather than progressive taxation. . - 
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quite a problem even for a man with definite opinions and great 
preoccupation with value judgments If, however, he takes refuge 
in assumption (5) that individual preferences are to count, the 
individual can be left to decide for himself how he will spend his 
money at given prices. Our ethical observer need only decide 
then what his preferences are as between the given levels of satis- 
faction of different individuals 

It might be thought that our ethical observer, even if the indi- 
\iduals themselves had no unique cardinal indexes of utility, would 
have to find cardinal indicators But this would be quite incorrect 
Of course, if utilities are to be added, one would have to catch hold 
of them first, but there is no need to add utilities * 7 The cardinal 
utilities enter into the W function as independent variables if as- 
sumption (5) is made But the W function is itself only ordinally 
determinable so that there are an infinity of equally good indicators 
of it which can be used. Thus, if one of these is written as 

W = F(IP, IP, ), (13) 

and if we were to change from one set of cardinal indexes of in- 
dividual utility to another set (F 1 , F 1 , ), we should simply 

change the form of the function F so as to leave all social decisions 
invariant Thus, let us move from one configuration of goods 
going to the different individuals to another configuration which 
leaves W unchanged or which is socially indifferent. Then no 
redefinition of the ITs or of F can change this fact the social in- 
difference loci are independent of cardinal numbering And in this 
terminology the significance of assumption (5) that individual 
tastes shall count is contained in the requirement that the social 
indifference slopes between two goods going to the same individual 
are exactly the same as the individual’s indifference ratios As- 
sumption (6) adds that these curves are unaffected by changes in 
the goods going to other individuals 

,T Even if one wished to add utilities, it would seem silly from any ethical viewpoint 
to have one's opinions as to correct taxation influenced by how consumers spend their 
income on goods Yet that is what the recent attempts to measure marginal utility 
must imply if they have any pretensions to relevance for policy R Fnsch, Neur Methods 
of Measuring Marginal Utility (Tilbingen, 1932) I Fisher, “A Statistical Method for 
Measuring ‘Marginal Utility' and Testing the Justice of a Progressive Income Tax,” »" 
Economic Essays Contributed in Honor of John Bales Clark (New York, 1927) 
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Mathematical Analysis 

The assumptions discussed above can be given mathematical 
formulation. Thus, assumptions (2) and (3) imply that the impor- 
tant variables of our system can be regarded as a number of com- 
modities and productive services (X h X 2 , • ■ - ,X n , V h V 2 , • • •, V m ). 
These totals can be distributed among s individuals of the system ; 
the amounts going to each-can be written with a superscript which 
identifies the individual and a subscript identifying the commodity 
or service. (While productive services could be written as negative 
commodities, I have chosen to conform to the more common pro- 
cedure encountered in the literature.) 

Xi = Y, *i k , (i = 1, • • ■, n) 

k ~ l (14) 

Vi = X>/. (j = 1, •••,»») 

k~l 

The social welfare function involves only the amounts going to 
each individual, not prices or totals. Therefore, equation (11) 
is specialized to 


W = W(Xi l 


x ri ^ , , aT, 


yd 




;»i* 


*)• (15) 


This exhausts the implications of the first four assumptions. 

Assumptions (5) and (6) further specialize (11) so that the 
welfare function can be written as 


W = IT[ U^Xi 1 , • • • , Xn 1 ; vi 1 , - • • , vj), 


U’(xc, ■••,»»•)]. ( 16 ) 


where the cardinal forms of the U’s and the W are arbitrary. 
Assumption (7) is meaningless unless the respective U s can be 
made identical; but if this can be done, W must then be a sym- 
metric function of the U’s. Assumption (8) requires that there 
exist a cardinal W and cardinal U’s such that W is a summation of 
the U' s. On the anthropological level this involves (aside from its 
arbitrariness) definite restrictions on the social rates of indifference 
between the commodities and services off the same and different 
individuals. These are similar to the empirical restrictions c. 
independent individual utility as discussed in chapter vii. 
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In the previous section it was noted that the ‘'technical" re 
stramts must be assumed w ith the same arbitrariness as the welfare 
function itself However, since the formulation of general equi 
librium by Walras it has been customary to take as given, by the 
engineer the fundamental relationships between inputs and outputs 
and that production itself takes place in firms or industries which 
are distinct from the individuals having no value m and of them 
selves Under modem industrial conditions this is not unrealistic 
But even here many interesting alternatives can anse What one 
calls economics economic engineering, engineering, etc , is to a 
considerable degree a matter of choice One can assume that all 
production decisions involving relative marginal productivities are 
the concern of the engineer, or economic engineer, and that the 
economist can take as already derived a transformation relationship 
between the X s and V s of the form 

T{XiX if ,Xn,VtV,, , V m )~Q (17) 

This implicit relation is interpreted to give the maximum amount 
of any one output as of given amounts of all inputs and all other 
outputs or the minimum amount of any one input as of given 
amounts of all outputs and all other inputs 

But if one starts from more primitive technological assumptions 
such as the production functions of cacli commodity, the trans 
formation locus is a derived theorem and not an axiom Behind 
it he many interesting optimal production conditions involving 
marginal productivities and other magnitudes usually thought of 
as economic rather than engineering 

Production Conditions 

In a synthesis of welfare economics we can derive first a set of 
production conditions which require for their validity the weakest 
of all ethical assumptions simply that more of any one output, other 
commodities or services being constant, is desirable, similarly, less 
input for the same outputs is desirable 

Under the simplest technological conditions we can take as 
given the production function relating each output to the inputs 
devoted to it 



WELFARE ECONOMICS 


231 

where the first subscript indicates the kind of productive factor 
and the second subscript the commodity to which it is applied. Of 
course, the total applied to all commodities is given by 

Vj = Vjx -j~ Vji + • • • + Vj n . (j = 1, • • •, m) (19) 

Ordinarily it is assumed that the production function possesses 
partial derivatives (marginal productivities), but the above form- 
ulation includes the case of so-called fixed proportions or fixed 
coefficients where the production function has contour lines with 
corners. 

Equations (18) and (19) represent ( n + m) relations. If we 
specify arbitrarily all but one output (service), we can maximize 
(minimize) the remaining one. This is an extremum problem in 
which there are auxiliary constraints. The first order partial de- 
rivative conditions can be expressed directly as proportionalities 
and rank properties of the matrix of first partial derivatives of our 
functions. But it is illuminating to express these directly by 
means of the artifice of Lagrangean undetermined multipliers. To 
do this we set up the function 

$ = IT 1X1 + TV 2X2 -}-••• 

+ Tf n X n + X 1 F 1 + X 2 F 2 + • • • + X m F m (20) 

and pretend to maximize it treating all the variables as if they were 
independent, and treating the tt’s and X’s as (undetermined) con- 
stants. If the correct secondary extremum conditions are written 
out in full, it will be seen that they are quite different from those 
that would have to hold if the $ function were really to be maxim- 
ized. Only by accident would these secondary conditions coincide; 
only by accident would $ not be at a minimum or would it be at 
an extremum rather than simply at a stationary value. 

This may seem to be an esoteric and recondite point of little 
practical significance. It is the primary contention of the present 
work that everything interesting is contained in the inequalities as- 
sociated with an extremum position, rather than in the equalities. 
This is no less true of the field of welfare economics. From a 
deeper point of view market prices, regarded as parameters by perfect 
competitors, are nothing more nor less than Lagrangean multipliers. 
The Langrangean expression $ corresponds to Barone’s <t> function 
or to Hotelling’s price potential and can be regarded as the value 
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of output or national product, expressed in terms of money or any 
numeraire, and generalized by the subtraction of factor costs 
If the game of competition leads to optimal conditions it docs 
so partly by accident For it is precisely under the conditions 
favorable to the maintenance of atomistic competition (briefly 
absence of decreasing costs) that the secondary conditions of the 
correct maximum problem agree with those guaranteeing the max 
lmization of the money value of output as of fixed prices Where 
there are substantial technological increasing returns, competition 
as an empirical phenomenon breaks down It is in such conditions 
that collectivism is likely to be considered seriously as a social 
policy If then a socialist regime should insist upon mechanically 
playing the game of competition with prices regarded as parameters 
a la Lange, 28 its managers would rush away from the minimization 
of $, even though this minimization may be precisely what is 
required under decreasing cost if welfare is to be maximized 

The moral is not that prices should cease to be related to mar- 
ginal cost or that planning under socialism is impossible It is 
simply that the decentralized operators m a planned society should 
refrain from a literal apmg of atomistic, passive parametric price 
beha\ lor Instead of pretending that demand curves are infinitely 
elastic when they are not the correct shape of the curve 16 to be 
taken into account This does not mean that the decentralized 
operators should take account of their influence on price as a 
monopolist would 23 

After the elimination of Lagrangean multipliers the first order 
maximum conditions take the form 

dX t dX , 

3du _ flv m , 

tiXfc - “ ali k 

dvn di mk 

11 0 Lange article in On ike Economic Theory of Socialism B E L pp ncott ed 
(Minneapolis University of Minnesota Press 1938) pp 55-142 

When we come 10 the foil statement of welfare conditions it will be seen that 
unusual d faculties do arise in the decreasing cost case in determining whether a given 
maximum position represents a maxtmum maximorum or whether the number of differ 
1 ated products should be reduced 


(f.,k = i.. 


(21} 
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or the equivalent form 

ax, 

dvji 

aXj “ " 

dVri 

In words this takes the form : productive factors are correctly 
allocated if the marginal productivity of a given factor in one line is 
to the marginal productivity of the same factor in a second line as the 
marginal productivity of any other factor in the first line is to its mar- 
ginal productivity in the second line. The value of the common factor 
of proportionality can be shown to be equal to the marginal cost of the 
first good in terms of the ( displaced amount of the) second good. 30 

Geometrically these conditions can be easily derived in the two 
good, two factor case by means of a Jevons-Edgeworth-Bowley- 
Lerner diagram consisting of a box whose respective sides are 
equal to the available total amounts of the two factors. Any 
point within the box, if oriented with respect to the lower left-hand 
corner, can be thought of as representing the amounts of the two 
factors used in the production of the first good, and the iso-product 
contour lines can be sketched in. The same point when referred 
to the upper right-hand corner represents the allocation to the 
second good, whose contour lines can also be superimposed on the 
same diagram. 

If we specify the amount of one of the goods and restrict move- 
ments to one of these contour lines, the optimum position is reached 
only when we have touched the highest output line of the other 
good, or at a point of tangency of the opposing iso-product lines. 
The geometrical diagram indicates the correct secondary condi- 
tions which are quite different from conventional diminishing re- 
turns. The locus of all such points of tangency, which can appro- 
priately be called a “generalized contract’’ cum, represents the 
infinity of optimum positions. If along this locus we read off the 
amounts of the respective products, and plot one of these magni- 

30 By introducing as variables inputs and outputs of different dates optimal behavior 
over time can be included in the above formulation. However, when this is done, it 
will be seen that contrary to the belief of most economists since the time of Bohm- 
Bawerk, no single real interest rate is implied for a capitalist or socialist state. 
Equality would be a necessity only in the highly unusual case where relative prices of all 
goods remain the same over time. 
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dVjn 

dX n 
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In 
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(j,r = 1, (22) 
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tudes against the other, the resulting locus is the substitution, 
transformation, or opportunity cost curve, T. The slope of this 
curve at any point represents the marginal cost of one of the goods 
in terms of the other, or the ratio of marginal costs expressed in 
terms of some third magnitude. 31 

The substitution curve is drawn up as of given amounts of the 
factors of production and will shift with any changes m the latter. 
For a given transformation curve to be of any relevance, the factors 
of production must be regarded as being indifferent as between 
different uses 32 

If constant returns to scale and only one factor (perhaps equal 
to a composite dose) are assumed, we have the classical case of a 
straight line transformation curve If the first of these assump- 
tions is made, but there is more than one factor of production, 
unless the goods should just happen to use the factors in the same 
proportion, the transformation curve will be concave to the origin 
because of the law of diminishing returns to varying proportions. 
The curve can also assume this shape for other reasons relating to 
returns to scale. In the decreasing cost case, however this arises, 
the curve will be convex to the origin. 

In all of these cases let the firms or planners really take prices 
as given and attempt to maximize value of output or 4». In the 
constant returns case there will be complete specialization on one 
of the goods (the one with comparative advantage) or complete 
indifference at the critical price ratio equal to the cost transforma- 
tion ratio If both goods are consumed, and if the economy is a 
closed one, the latter critical price ratio is sure to be the only 
relevant one. 

In the increasing cost case at given price ratios the firm will 

81 For the geometrical picture see W F Stolper and P A Samuelson, “Protection 
and Real Wages,” Renew of Economic Studies, IX (1941), 58-74 

** It is only when the last two unnecessary and unrealistic assumptions are made 
that the so-called opportunity cost doctnne is even formally valid Even under these 
conditions the usual formulation is rather a mumbo jumbo of high sounding gibberish, 
which does not state the conditions of equilibrium in a very direct way All this is 
further complicated by the fact that most enunciations of the opportunity cost doctnne 
are purely verbal leisure is treated as a displaced good! Inevitably, therefore, when 
the opportunity cost doctnne is carefully stated and qualified, it degenerates into the 
full conditions o! general equilibrium m which factor supply and preference equations 
must be introduced if only as inequalities This is not to imply that one must accept 

dubious psychological language and interpretations of classical real cost theorists 



WELFARE ECONOMICS 


235 

come into equilibrium at the levels of output which make prices 
proportional to marginal (transformation or factor) costs. How- 
ever, in the decreasing cost case the planners in a socialist state at- 
tempting to maximize at fixed prices would concentrate on one or the 
other of the two commodities. But this would be optimal only in 
an open economy which could really trade with the outside world 
(in unlimited amounts) at the given price ratio. In a closed planned 
state it might be desirable to produce something of both, to have 
price ratios equal to marginal cost ratios even if that meant a mini- 
mization of <h. If we regard the passive approach of the decen- 
tralized planners as an equilibrium process, it is clear that the cor- 
rect optimum, being a minimum and not a maximum, would not be 
an equilibrium point at all. Or if the planners acted like Balaam’s 
ass, it could be regarded as a highly unstable equilibrium point 
from which they would rush at the slightest disturbance. 

Thus it is only accidentally — the first two cases — that com- 
petition is optimal. For it is in those cases that the Lagrangean 
expression (20) is at a maximum when we are at the correct maxi- 
mum of (18). In these accidental cases, if we like, we may think 
of our production decisions as being independent of preferences, in 
the sense that they are uniformly best with respect to the most 
extreme weightings of the commodities. Under increasing returns 
we may even never settle on society’s best transformation curve. 

Actually our production conditions might have been introduced 
in more complex form than as single, isolated production functions 
of the form (18). It is to a certain extent arbitrary as to how much 
preliminary maximizing behavior we consider as being done by the 
engineer and how much by the economist. Under competitive 
conditions the relevance of prices to the decision is taken as the 
dividing line ; but for welfare purposes we may dispense with prices 
completely. However, it would seem somewhat awkward then to 
term the problem one of engineering; still some economists 
might prefer to begin with the transformation function, T. 

If we examine production possibilities more closely, it will be 
apparent that in many cases it will not be possible to end up with 
a single transformation function, T. Thus suppose that commodi- 
ties divide up into two or more groups such that no productive 
factor is used in more than one group. Then there will be at 
as many independent transformation curves as there are ■ 
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Of course, formally these can be combined into a single implicit 
equation, e.g., by equating their summed squares to zero, but this 
is trivial. No indeterminacy arises, however, from the multi- 
plicity of transformation curves; we have fewer choices to make 
and therefore fewer marginal conditions. 

Pure Exchange Conditions 

The production conditions of the previous section were deriv- 
able on the basis of rather mild assumptions by virtue of the fact 
that more of all goods, however they are to be divided, would seem 
better than less. These conditions are still only necessary. They 
are not sufficient, since decisions must still be made as to how to 
divide up given available goods, and as to which of the available 
amounts of production will actually be used. 

Even if the last of these two decisions is made, there remains 
the problem of dividing up a given total of all commodities and 
services among individuals. Once there has been specified a clearly 
defined W function, the final optimum point can be easily deter- 
mined. But at this point after assuming, if only implicitly, the 
first six serious assumptions, the modern welfare economist be- 
comes timid and hesitates to make assumption (7) or indeed any 
other specific welfare function. He has been conditioned against 
making ethical assumptions. If possible, therefore, he would like 
to develop further optimum conditions which are within broad 
limits independent of the form of the W specified. In fact, as has 
been known since the time of Pareto at least, it is possible to 
gratify this wish and to specify still another set of necessary, but 
not sufficient, conditions which must prevail as between individuals. 

Here the simple assumption is made that W is undefined, or 
unspecified, except to the extent that it is a monotonic increasing 
function of the individual U’s. We are not able to plot in the 
(U 1 , f/ 2 ) I plane a locus of points along which W is constant. We 
know only that a movement in the northeast direction of the plane 
increases W and is good, a movement in the southwest direction 
decreases W and is bad; a movement into either of the other two 
quadrants taken around a given point is indeterminate. 

In words, the simple assumption is made that only if all indi- 
viduals are made better (worse) off can we definitely state that a 
movement is good (bad). Otherwise we must reserve judg- 
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ment. Mathematically it is clear that for the levels of utility of 
all but one individual being a r bi traril y specified, it is a necessary 
condition of equilibrium that the utility of the remaining individual 
be at a maximum, subject to the condition that there be fixed totals 
of all goods. For 


Z xj = Xi, £ v r i = Vr, 
j=i i = 1 

U j {xF, - • - , x n ] , vF, • • • , vj) = U\ (j ^ k) 
we must maximize 


(23) 
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By using Lagrangean multipliers or by direct methods, it can 
be easily shown that in equilibrium the ratio between marginal 
utilities of two goods consumed by one individual must be the 
same as the ratio of marginal utilities of the same goods to any 
other individual who consumes the same goods. If one or more 
of the goods is absent from the consumption of an individual, cer- 
tain inequalities can be introduced to generalize the above condi- 
tions. We may write the first order conditions of equilibrium in 
the form 


El - HI - = El 

Uf Uj 2 u,*' 


(25) 


It will be noted that only ratios of marginal utilities for the same 
individuals are ever involved . 33 Thus, there is no need for nu- 
merical utility even for a single individual and no need to compare 
the utilities of different individuals. 

Graphically the equilibrium may be represented by the same 
box-type diagram as was mentioned in the last section. The di- 
mensions of the box represent the fixed totals of the commodities 
in existence. Any interior point oriented with respect to the 
lower left-hand comer represents the amount of consumption of 
the first individual, and the contour lines are now to be interpreted 
as indifference curves. Referred to the upper right-hand comer, 
a point represents the second individual’s consumption, and his 
contour lines may be superimposed on the other. Specifying the 

33 Equilibrium conditions regarding the factors of production need not be indicated 
separately if it is remembered that they may be treated as negative commod 
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utility of the second individual, we maximize that of the first by 
moving along the given indifference line of the second individual 
until we reach (at a point of tangency) the highest indifference 
curve of the first Because the original specification of the second 
man's utility was arbitrary, the final equilibrium is also arbitrary 
and not unique The locus of all such arbitrary points is, of 
course, the familiar Edgeworthian contract curve and represents 
the set of points satisfying necessary exchange conditions 

After the discussion of the previous section it will not be neces 
sary to reiterate at length the contention that the equality of the 
\anous ratios can be expressed without Lagrangean multipliers 
and without price ratios, moreover, in by no means impossible 
cases, even playing the game of competition would lead away from 
rather than toward the correct optimal position As before, this 
is because the secondary conditions for a constrained maximum 
are not the same as the secondary conditions necessary literally to 
bring the Lagrangean expression to an extremum 

It will be noted that from any point off the contract curve there 
exists a movement toward it which would be beneficial to both 
individuals This is not the same thing as to say, with Edgeworth, 
that exchange will in fact necessarily cease somewhere on the con 
tract curve, for in many types of bilateral monopoly a final equi- 
librium may be reached of? the contract curve Nor is it the same 
as to say that points on the contract curve are better than points 
off the contract curve Later I shall discuss the correct formula 
tion of the significance of this condition 

But first it is well to write down the full first order conditions 
of equilibrium which must hold if we are to have an optimum of 
both production and exchange, that is, if we combine with the 
conditions of this section the conditions of the last section Then 
(2) ue must have a common marginal rate of indifference between any 
two goods for every individual, this common indifference ratio must, 
moreoier, be equal to the ratio at which one of these goods can be 
transformed into the other in a production sense , the transformation 
to come about as the result of transferring any resource from one 
good s production to the other's ( 2 ) We must have for all individuals 
c common ratio of indifference between supplying more of any factor 
' production and enjoying more consumption of a given good, this 
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common ratio must be equal to the rate at which supplying more of 
that factor results in greater production of the good in question. 
Mathematically, as Lange has shown, 




_ Uxj T Xi ... 
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x i x i 
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(26) 


These conditions may be rewritten in a variety of ways but only 
being (at the first order level) necessary conditions rather than 
sufficient conditions, they are necessarily less in number than the 
number of the unknowns of the system. This is after taking ac- 
count of the fact that the total of factors used in every use and the 
total of commodities consumed must equal the sums of these mag- 
nitudes over all individuals, i.e., even after taking into account 
equations (14). We are still short ( s — 1) equations. These can 
only be supplied on the basis of definite assumptions concerning 
how different individuals enter into the W function. This will be 
discussed in detail in the next section. 

Returning to the combined production and exchange conditions 
of equations (26), I should like to point out various alternative 
formulations of them. First, it is easy to derive from them the 
conditions that the ratios of indifference between any two factors 
must be uniform for all individuals, and equal to the relative 
marginal productivities of the two factors in any line of production. 

Second, it is sometimes convenient to interpret the first of these 
equilibrium conditions as specifying the equality of price and mar- 
ginal utility ratios to the ratios of marginal cost. There is occa- 
sionally some uncertainty in the literature as to whether prices 
should equal marginal costs or just bear the same percentage dis- 
crepancy in every line. The distinction is especially important if 
decreasing costs are very prevalent, for then it is impossible for 
capitalistic monopolies or socialistic bureaus to recover their full 
costs of production if they charge only prices equal to marginal 
costs. Recourse must be had to direct taxation to cover the differ- 
ence or to the imputed revenues accruing to state-owned factors of 
production, revenue which in a world of constant returns to scale 
would be available for distribution to the citizens of the state. 
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So long as only price ratios are used it is not necessary to state 
the units in which costs are measured dollars labor fertilizer or 
opportunity substitution costs In the usual exposition partial 
equilibrium conditions are implicitl> assumed and costs wages 
and prices are expressed in terms of dollars In these conditions 
must prices be equal to marginal costs or just proportional •* The 
answer is equably if we take into account the conditions relating to 
factors of production such as are embodied in the second part of 
equations (26) If all factors of production were indifferent be 
tween different uses and completely fixed in amount (the pure 
Austrian case) then wc could dispense with these conditions and 
proportionality of prices and marginal cost would be sufficient 
But if we drop these high!} special assumptions for which there is 
not m any case empirical or theoretical warrant then if all prices 
were proportional to (say double) marginal costs wc should not 
have an optimum situation B> working a little more or less 
everybody could be made better off since in the described situation 
the preferred terms at which they exchange leisure and goods are 
not equal to the true productivity terms at winch they can be 
transformed into each other 

I need not remind the reader that first order conditions are 
really of secondary importance as compared to the full inequalities 
implied by an optimum position Also it should not be necessary 
to show in detail that the abo\ e conditions hold only if every factor 
of production is actually used m c\cry line of production and if 
something is produced of e\cry good and if all production and 
indifference functions have continuous partial dcrii atives If any 
of these conditions fail the number of equalities may be diminished 
but in every case the correct general inequalities will pre\ ent there 
being any essential ambiguities in the characterization of the 
optimum position A closely related problem arises when there 
are multiple positions of equilibrium With strongly convex m 
difference fields and strongly concave cost functions such cannot 
occur But there is no reason why human nature should display 
the simple regular properties which the observer economist finds 
convenient And it is the essence of the decreasing cost increasing 
returns Iumpiness indivisibility phenomena that improper curva 
ture on the cost side should enter the picture 

Especially where differentiation of product arises as a result of 
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monopolistic competition or as its cause, there arises the problem 
as to whether or not a commodity should be produced at all. If it 
is produced, the marginal cost conditions of (26) should be realized, 
but there may be a better maximum where none is produced. 
Here the extremum position is of the corner type, and the conven- 
tional equalities must be replaced by inequalities. This involves 
decision making at a distance; we cannot feel our way to the op- 
timum, step by step, but must boldly experiment with diverse 
combinations. Where such “all or none” phenomena are con- 
cerned, things often get worse before they get better, and so deci- 
sions im kleinen will not suffice. Even aside from the difficulties of 
the paragraph after next, we cannot decide that a thing should be 
produced if a perfectly discriminating monopolist or government 
bureau could recover its total cost by some sort of a take-it-or- 
leave-it offer. In these cases involving finite decisions we must 
ask consumers (or Robinson Crusoe) whether a given abundance of 
fewer commodities is preferred to an alternative scarcity of a 
greater range of commodities. 34 At this point there will inevitably 
arise questions concerning the rationality of individual choice, 
questions which had been summarily suppressed by a cavalier 
acceptance of the validity of assumptions five and six set out in 
the third section back. But these questions, whatever their im- 
portance, cannot be settled by deductive analysis. 

The equality of price to marginal cost creates confusions in the 
minds of many concerning the question as to whether or not costs 
which are fixed in the short run are not variable in the long run and 
hence should be covered by price. They ask whether price should 
not exceed short run marginal cost since the latter excludes elements 
of long range variable costs. There are two confusions revealed by 
this question. First, marginal cost is sometimes treated as a part 
of total unit costs, which it is not. Marginal cost is the difference 
between costs in two situations and cannot be identified in general 
with given components of cost, labor, materials, etc. Sympto- 
matic of this confusion is the statement that competition tends to 
drive price down to prime or variable costs so that full costs are not 
recovered. Of course, pure competition (which is not really im- 

51 In certain special cases consumer’s surplus may be employed to describe finite 
inequalities. But these cases are rare, and in any case we are rather better off if we use 
direct methods. 
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plied m the previous sentence) involves price equal to marginal 
cost, which may far exceed full unit costs depending upon the scale 
of output and the level of price Not only did Alfred Marshall 
speak loosely on this matter but so great a contribution as J M 
Clark's Economics of Oierhead Costs errs at least in exposition in 
its diagnosis of the role of overhead costs in the breakdown of 
competitive theory An examination of the character of agncul 
ture, the last stronghold of pure competition, would show a high 
level of overhead costs with no tendency tow ard the breakdown of 
competition Aside from non mutable differentiation of product 
the principal cause of disintegration of competition is decreasing 
long run unit costs of each firm at levels representing large fractions 
of total demand for the product, this would be true even if all goods 
could be produced to order 

Marginal cost is not part of cost which must be met, and the 
equality of price and marginal cost has nothing to do with a re 
covery of full costs determination of fair return on investments 
correct imputation of factor shares, etc Its purpose is to secure 
correct factor allocation, and avoid anomalous product allocation 
This is accented if one considers a wavy cost curve or a cost curve 
showing a comer at the point of capacity (long run or short run) 
At the latter point marginal cost is undefined, or, if you prefer, it 
is anywhere from a finite amount to infinity There is no clue as 
to proper price determination, but this is of no consequence so long 
as production does actually take place at the capacity point 
Similarly as demand fluctuates for a commodity whose cost curve 
shows waves because of lumpiness, etc , the correct price charged 
will vary greatly, rather than let a train set out with a single empty 
seat, the price should fall to zero It is no argument against this 
statement that such pricing may not recover full costs or that 
passengers will tend to wait until trams are least crowded What 
could ho to.oxe doyxa.hki socvvlbj vb.av. the evewvw?, cast of traffic? 
Does not a railroad company which gives out passes to its own 
employees often follow the sound principle that they may ride as 
they please so long as they add nothing to costs’ Incidentally, it 
will be seen from this discussion that where small changes in de 
mand create great changes in social cost, posted, administered, 
relatively stable prices are not optimal devices 

The second and less important confusion involved in the state 
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ment that price should exceed short run marginal cost stems from 
the fallacious belief that long run marginal cost “inclusive of 
changes in long run variable factors” is greater than short run 
marginal cost. Actually, we know from the Wong-Viner-Harrod 
envelope theorem that they are equal for instantaneous rates of 
change if production is taking place at the anticipated level. For 
an actual, finite, forward movement from this point, short run 
marginal cost “exclusive of the fixed factors” is of course greater 
not smaller than marginal cost when all factors are varied opti- 
mally. Whether or not output is at the anticipated level, price 
equal to short run marginal cost is necessary in order that the 
existing plant be used optimally; the relationship between price 
and long run marginal cost is relevant to a decision as to whether 
to change the size of plant, when that decision arises. If one 
dropped the first order statement of optimum conditions and re- 
placed them by the general inequalities which must be satisfied, 
it would be obvious that the optimum position must satisfy a 
different maximum condition (inequality) for every alternative 
which can be contemplated. Thus, it must not pay to make a 
small step forward or backward, a middling step, a sizable step, 
a huge step, to shut down completely or open a new line, etc., etc. 
Each of these implies a relationship between price or revenue and 
entirely different measures of marginal or differential cost . 35 

Interpersonal Optimal Conditions 

In the last two sections an attempt was made to derive as 
general conditions as possible with a minimum of controversial 
assumptions. Nevertheless, we have seen that it is not possible to 
deduce a unique equilibrium unless we have more to build upon. 
This is only as it should be, for intuition assures us that there can- 
not be an optimum position which is independent of the exact form 
of the W function. Even if all of the necessary conditions of 
production and exchange are satisfied, we are short as many equa- 
tions as there are individuals in excess of unity. In a Robinson 
Crusoe world where there is but one individual ( pace Friday !) there 

56 Once this fact is mastered, the whole question of “allocation of perfectly joint 
costs” is seen to be a false and irrelevant problem for either a firm or society. It derives 
its present day empirical importance from the intrusion, not necessarily irrationally, of 
“full cost” considerations into price formation, and to problems of government regula- 
tion, e.g., T.V.A., Tariff Commission, etc. 
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are no individuals in excess of unity and the equilibrium is unique 
But as soon as we have more than one individual our previous 
conditions simply assure us that we are on the generalized contract 
locus from which there are no possible movements which are 
advantageous to each and every individual 

An infinity of such positions exists ranging from a situation in 
which all of the advantage is enjoyed by one individual through 
some sort of compromise position to one in which another mdi 
vidual has all the advantage Without a well defined W function 
i e without assumptions concerning interpersonal comparisons of 
utility it is impossible to decide which of these points is best In 
terms of a given set of ethical notions which define a Welfare 
function the best point on the generalized contract locus can be 
determined and only then 

All of this may be formulated mathematically as follows The 
optimal conditions of production and exchange as given m equa 
tions (26) which summarize the argument of the previous two sec 
tions permit us to reduce the level of indeterminacy of the system 
to an implicit equation between the levels of wellbeing of the 
different individuals in the system This may be written as 

P{U l IP U‘) = 0 (27) 

This says that we may specify at will the levels of well being of 
all but one of the individuals and the last one s well being is 
uniquely determined The essential shape of this Possibility func 
lion depends of course upon the technological and other assumed 
restraints of the system as well as upon the tastes of the different 
individuals Notationally the same implicit function can be 
written in many different ways but the locus in question is mvari 
ant under these purely terminological changes One should note 
however that since the exact numerical form of each U is arbitrary 
no importance attaches to the curvature properties of this locus 
If we have a given definite welfare function then we are to 
maximize 

W = W(JP U‘) (28) 

subject to the above constraint The first order condition of 
equilibrium takes the form 
W_ _ P_ 

W, ~ P, 


(*7 = 1 s ) 


(29) 
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These conditions, like the correct secondary conditions which need 
not be written explicitly, are independent of purely notational 
ambiguities involved in the welfare function, the possibility func- 
tion, and the selection of particular cardinal indices of individual 
utility. 

Equations (29) give us the missing (s — 1) conditions of equi- 
librium, and our equilibrium is seen to be finally determinate. It 
is not at all necessary to take the expositional path followed above. 
Thus, Professor Pigou, who does not hesitate from the very be- 
ginning to make inter-individual comparisons of utility, moves 
directly via a maximization of (11) subject to (12) to the same equi- 
librium which Pareto-Barone-Lemer reach after they have ad- 
mitted into their systems interpersonal comparisons of utility. 
And as will be discussed at greater length below, they end up with 
no optimum and no final definite conclusions if they are unwilling 
to permit others to introduce such ethical considerations. 

The real significance of their analysis lies in the fact that they 
provide a relatively easy opening for the introduction of ethical 
notions. For it follows from their analysis that all of the action 
necessary to achieve a given ethical desideratum may take the form 
of lump-sum taxes or bounties. These may be given in the form of 
abstract purchasing power or in kind, it being necessary, however, 
that the latter should be freely exchangeable against other goods. 
Thus, instead of having to decide how much to allocate of each and 
every good to each individual, the ethical authority need only 
decide on the allocation of final incomes between individuals. 

Working directly from a well-defined (ordinal) welfare function, 
we require that W be maximized subject to the physical constraints 
of the system. We may express the final conditions in many 
different forms, with or without Lagrangean multipliers. One such 
compact method which avoids as far as possible duplicating the 
exact form (but not the substance) of the conditions of the previous 
section is given as follows: 


dW dW dW (i= 1 , •••,») 

dxi 1 “ ' ‘ ~ dx .* ~ eXi ’ (J = 1 , * * •, m) 

dW dW dW dW dXi _ _ dW dXn 

dVj 1 dVj s dVj dXidVji dX n dVjn 


( 30 ) 
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Not only are these equations very similar to the famous Note 
XIV which summarizes much of what is best in Marshall s Pnn 
ctples, but they are cast almost exactly in the form followed by 
Professor Pigou in his Economics of Welfare In words, two condi 
tions are implied first, the marginal social utility ( disutility ) of the 
same good ( service ) must be equal for each individual, second, each 
factor of production must be divided among different possible uses so 
that its indirect, derived marginal social utility must be the same in 
every use and equal to its marginal social disutility 

We may leave to the reader the task of showing the necessary 
modifications if the supply of a factor is inelastic, if a factor of 
production is not indifferent between different uses if there is joint 
production, if there are external technological economies or dis 
economies (so that the production function of a good contains m 
it factors of production devoted to other uses), etc By dividing 
the above equations through by any single partial derivative, they 
may be thrown into a form which is independent of the particular 
cardinal representation of W 

While the Lemer production conditions are contained m the 
above set of equations the exchange conditions are not However, 
if we make the individualistic assumptions five and six above so 
that the welfare function takes on the special form given m equa 
tion (13), then in virtue of the identities 

dW _ dWdU r 

dx; ~ dU'dx; { ) 

the exchange conditions of the previous section are also included 
in these equations Thus the production and exchange conditions 
which constitute the “new welfare economics" are included in the 
old, but are themselves incomplete 

If we substitute the last identities in the fundamental equations 
(30), they can easily be thrown into the production and exchange 
conditions given in (26) plus the new interpersonal element guaran 
teeing us that the distribution of incomes and lump sum allowances 
is optimal This takes the mathematical form of equality of 
marginal social utility of income (expressed in terms of any good) 
for all individuals 

dW 


dW dW 
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The statement that any "individualistic” ethical optimum can 
be effected by lump-sum taxes is a theorem, not an axiom. For its 
proof we require only that the individual indifference curves be such 
that they can be swung into final equilibrium by operating upon 
their incomes, letting them trade mechanically at fixed prices. 
The vast importance of this theorem for social policy in a planned 
or free enterprise economy is deserving of the greatest emphasis. 
Yet its greatest significance is in the realm of administration and 
tactics. From the standpoint of the logic of welfare analysis it is 
not fundamental. In fact it is not universally true. 

First, the indifference curves of one or more individuals may 
have curvature properties such that the individual who believes 
or pretends to believe that prices are parameters beyond his control 
will rush away from rather than toward the correct optimum posi- 
tion. The remarks in the section above dealing with production 
which discuss the analogy between prices and Lagrangean multi- 
pliers and secondary conditions may be referred to here. 

Second, and of more practical importance, the optimum position 
may be reached by means of discriminating all or none offers, 
sliding scales, etc., rather than by trading at uniform prices. The 
important thing is the realization of appropriate marginal condi- 
tions rather than what happens to the intra-marginal units. (Com- 
pare on this point the discussion above of recovery of full cost as 
a goal of pricing.) In a regime such as this, lump-sum allowances 
may not suffice to realize the optimum point. 

Third, it is not really easy to devise in practice a tax or subsidy 
which is of a purely lump-sum character. A tax upon income 
affects marginal decisions with respect to effort and risk-taking. 
This is obvious. Less conspicuous is the fact that the usual ethical 
notions compel us to set our allowances according to a man’s 
"circumstances,” which are partially the result of his own actions 
and decisions. Analytically, the problem resembles that of deter- 
mining fair piece rates or of setting a fair "handicap” for golfers 
of different ability. We wish to equalize opportunity for all con- 
testants, but we do not want them to hold back from playing their 
best because of a fear of losing their favorable handicap. Ideally, 
the social managers would have to know the potentialities of every 
individual; and to remedy the distortions of imperfect lump-sum 
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taxes they would have to set a system of quotas and penalties 
based upon potentialities rather than upon performance 

Thus, we might decide that e\eryone should have at least 1 
minimum income, that Society will make up the deficiency betweer 
what the less fortunate can earn and this minimum Once this 11 
realized by those who fall below the minimum, there is no longer 
an incentive for them to work at the margin at least in pecumar> 
material terms This is clearly bad social policy, not because I 
have a vulgar prejudice in favor of work and against leisure On 
the contrary, the increases m real income m the years ahead prob 
ably will be spent sn considerable degree on leisure It is wrong 
because it forces the rest of society to give up leisure Its fault lies 
in the fact that the individual’s allowance is not a lump sum one 
It varies inversely with his effort, and thus penalizes effort It 
would be tolerable only if a small percentage of the population 
were below the minimum or if we could rely upon new motivations 
in the Utopia of the future I, myself, put considerable stock in 
the possibility of changing conventional patterns of motivation 
However, this will not provide comfort to those w ho wish to utilize 
a parametric pricing system with algebraic lump sum allowances 
since these same considerations undermine the ‘individualistic” 
assumptions upon which their analysis is based 38 

Fourth because decisions need only be made concerning m 
comes, the problem of formulating political slogans and beliefs 
which will command wide approbation is simplified However de 
sirable this may be from a political point of view, it must never be 
forgotten that from a consistent ethical point of view decisions 
should be made concerning the welfare function itself Beliefs con- 
cermng the distribution of income are derivativ e rather than funda 
mental Except in the admittedly unrealistic case where all tastes 
are identical, setting up such beliefs as goals is equivalent to ac 
cepting a "shibboleth” and to embracing an ambiguous undefin 
’• The clash between equity and incentive is not unimportant for the present stage 
of modern capitalism To an important first approximation the adverse effects of Ji gb 
marginal rate9 of taxation stem not from the level of the rates provided they are less 
than 100 per cent but from the curvature of the tax formula whereby gains increase 
fax liability more than equivalent losses reduce tax liability But the essence of pro 
gression in taxation and in income redistribution is curvature The only solution 
l es in pooling risks and in drafting tax legislation so as to produce a maximum of 
veragmg over time If this does not suffice equitv may still be worth considerable 
1a voidable costs 
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able welfare function. Among- other things such a procedure in- 
volves making a shibboleth of the existing distribution of relative 
technological scarcities of goods. (Of course, whether this is a bad 
practice is not in itself part of the content of welfare economics, 
which does not presume to deduce appropriate beliefs. However, 
welfare economics can legitimately point out the implications of 
different ethical propositions.) 

New Versus Old Welfare Economics 

While in a real sense there is only one all-inclusive welfare eco- 
nomics, which reaches its most complete formulation in the writings 
of Bergson, it is possible to distinguish between the New Welfare 
Economics, which involves roughly the contents of the sections on 
production and exchange and which makes no assumptions con- 
cerning interpersonal comparability of utility, and the Old Welfare 
Economics which starts out with such assumptions. Roughly it 
is the distinction between Pareto and Pigou. From our above dis- 
cussion it will be clear that the former is included in the latter, but 
not vice versa. 

Strictly speaking there is no opposition between the two points 
of view. However, it is not uncommon for expositors of the “new” 
set of doctrines to imagine that their results have significance even 
if one is unwilling to make any ethical assumptions. In fact this 
belief is almost a necessity for anyone who has taken seriously 
Robbins’ dicta concerning the inadmissibility of welfare economics 
in the body of economic theory. Once it is apprehended that the 
latter notion is a delusion, the need to believe in the significance of 
the production and exchange conditions divorced from the inter- 
personal conditions will disappear. Nevertheless, it may be de- 
sirable to appraise their significance taken by themselves. 

I pass over as being obvious from our discussion of the first six 
assumptions about the W function that it is not literally true that 
the new welfare economics is devoid of any ethical assumptions. 
Admittedly, however, its assumptions are more general and less 
controversial, and it is for this reason that it gives incomplete 
necessary conditions, whose full significance emerges only after one 
has made interpersonal assumptions. To refuse to take the last 
step renders the first two steps nugatory; like pouring out a glass 
of water and then refusing to drink. To say that the first two 
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conditions should be realized, but that the third is meaningless, is 
like saying that it does not matter whether or not a man has hair, 
so long as it is curly* 

A limited significance remains for the new welfare economics if 
we hold that a welfare function is definable but undefined, and if 
we look for conditions which hold true uniformly for all possible 
definitions It cannot tell us which of any two situations is better, 
but it can occasionally rule out one given situation as being worse 
than another in the sense that everyone is worse off It cannot tell 
us when society really has a choice between two given situations 
Most important of all it cannot tell us that a movement about 
which it can give a determinate answer is better than a movement 
about which it cannot give an answer If we are at a point not on 
the Possibility Function, it assures us that there exists a still better 
point But it cannot assert that a given point on the Possibility 
Function is better than all, or many, points not on the Possi 
bihty locus 

Concretely, the new welfare economics is supposed to be able 
to throw light on such questions as to whether the Com Laws 
should have been repealed This act would help many individuals, 
but hurt landowners It is usually assumed, although not neces 
sanly quite rigorously , that free trade can be expected to lead to a 
new “situation’ in which those who are benefited can afford to 
bribe those who are hurt It is thought that this constitutes a 
“case” for free trade Actually, this leads to no positive guide to 
action For the new welfare economics cannot state definitely 
“The Com Laws should be repealed, and the landowners should be 
compensated ” Occasionally such a statement is implicit in a 
writer’s formulation, but it is clear that this involves an unwar 
ranted assumption as to the adequacy of the status quo, an assump 
tion which the new welfare economics holds to be neither right nor 
wrong, but meaningless 

On the other hand, the new welfare economics cannot make the 
definite statement “The Corn Laws should be repealed, and the 
owners should not be compensated " It can only assert in a nega- 
tive way “ ’Twere better that the Corn Laws be repealed, and 
compensation be paid, «/ necessary” It gi\es no real guide to 
action Nor does our experience with man as a social animal sug- 
wt that one can safely predict, as a factual matter, that “educated 
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and intelligent men of goodwill” in point of fact tend to move to 
the generalized contract locus. As an empirical statement of fact 
we cannot agree with the assertion of Edgeworth that bilateral 
monopolists must end up somewhere on the contract curve. They 
may end up elsewhere, because one or both is unwilling to discuss 
the possibility of making a mutually favorable movement for fear 
that the discussion may imperil the existing tolerable status quo. 

Scitovsky 37 has attempted to remedy certain deficiencies in the 
usual statement of the new welfare economists by developing a 
double test of the desirability of two situations, in which the dis- 
tribution of income of the new situation is treated symmetrically 
with that of the old situation. While his is an improvement on 
certain earlier statements, it is so primarily in a negative way, to 
the extent that it delimits the scope of the new welfare economics. 
His own positive formulation is also a “ ’twere better” statement, 
and not a positive guide to action. Such statements are not with- 
out value, but they are not at all substitutes for the policy dictates 
which stemmed from the old welfare economics. 

Within the limited scope of the “ ’twere better” statements 
there is an ambiguity which Scitovsky alone seems to have sensed. 
The term “situation” can mean a number of different things. It 
can refer to an actual position reached by every individual prior to 
the repeal of a tariff, and to the new situation which would actually 
be attained by each individual after new action is taken, i.e., repeal 
with varying or no amounts of compensation. Or it can mean the 
totality of all possible positions available with the Com Laws un- 
repealed, and the totality of all possible positions with the Com 
Laws repealed. The latter is the more significant sense, and can 
be given significance in terms of the Possibility Function developed 
above. Thus, a perfectly legitimate “ ’twere better” statement 
can be made as follows: “Technological change can make everybody 
better off in the sense that it will shift the Possibility Function out- 
wards.” 38 We cannot deduce from this the dictum: “Techno- 
logical change is a good thing,” since the introduction of techno- 
logical change will in fact mean a vector movement from the old 

” T. Scitovsky, "A Note on Welfare Propositions in Economics,” and “A Recon- 
sideration of the Theory of Tariffs,” Review of Economic Studies, IX (1941-42), 77-88 
and 89-1 10. 

M In effect Scitovsky recognizes the case where the Possibility Function is given a 
twist rather than everywhere being shifted outward. 
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positions of every individual to their new positions which can 
hardly bode good to all 

At this point the new welfare economist will fall back on the 
vague statement that ' probably technological change will result 
in good on balance " Not only is this argument on its probability 
side based upon ignorance rather than knowledge, but it is meaning 
less as well unless the admissibility of a W function is admitted 

In closing I should like to point out that the point of view which 
I have been calling the new welfare economics is really only a 
caricature On the whole Pareto, Barone, Hotelling, and Lemer 
sidestepped interpersonal judgments rather than denied them 
Bergson synthesizes the various aspects, as does Lange Whether 
Kaldor and Hicks are open to the criticism along the lines which 
Stigler has recently advanced, 59 the reader must decide 

Conclusion 

In the present chapter I have attempted to give a brief but 
fairly complete survey of the whole field of welfare economics It 
would be possible to develop the subject further in many respects 
and to consider a number of related problems However, limita- 
tions of space preclude this, and I shall content myself with two 
final remarks 

First, what is the best procedure if for some reason a number 
of the optimum conditions are not realized? What shall we do 
about the rennming ones which are in our power ? Shall we argue 
that “two wrongs do not make a right” and attempt to satisfy 
those we can ? Or is it possible that failure of a number of the 
conditions necessitates modifying the rest? Clearly the latter 
alternative is the correct one A given divergence in a subset of 
the optimum conditions necessitates alterations in the remaining 
ones Thus, m a world where almost all industries are producing 
at marginal social cost less than price {either because of mono_poIy 
or external economies) it would not be desirable for the rest to 

** N Kaldor Welfare Propos tions in Economics Economic Journal, XLIX 
(1939) 549-552 J R Hicks Foundations of Welfare Economics Economic Journal 
XLIX (1939) 696 — 712 G J Stigler The New Welfare Economics American Eco 
norntc Review XXXIII (1943) 3S5-359 P A Samuelson Further Commentary on 
Welfare Econom cs A mertcan Economic Review XXXIII (1943) 604-607 See also 
P A Samuelson Welfare Economics and International Trade American Economic 
XXVIII (1938) 261-266 
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produce up to the point where marginal cost equals price. Neither 
would it be quite correct to seek the same percentage or absolute 
divergence from the optimum conditions in each case; although in 
this particular example, if the elasticity of the supplies of the factors 
of production were zero, the proportionality of prices to marginal 
cost would be as good as the exact equality. Still another example 
to show that failure of some conditions necessitates alteration of 
the rest is provided by the possibility of increasing welfare by de- 
liberately selling below marginal costs to groups with a high mar- 
ginal (social) utility of income. Given a faulty distribution of 
income, this can improve the situation, although it would be still 
better to have the full optimal conditions realized. 

The last point consists of the warning that the introduction of 
dynamic conditions into our analysis necessitates a considerable 
change in the statement of optimal conditions . 40 The difference is 
not one of principle, but it is nevertheless important. Judged 
purely on statical grounds, monopolies or a patent system may 
appear as unmitigated evils, and certainly inferior to atomistic 
competition and free trade. But in a dynamical world these judg- 
ments may have to be reversed ; viz. the infant industry argument 
for protection, the stimulus to large scale research which only a 
monopolist can afford, the (alleged) necessity to hold out incentives 
to inventors, etc. Indeed the measure of support which capitalism 
commands is most importantly related to precisely these factors 
of development. 

In admitting the superiority of monopoly to atomistic competi- 
tion in certain respects, we do not imply that it is the best possible 
organization of an industry. There necessarily exists a still better 
third alternative, which may or may not be less Utopian than the 
restoration and maintenance of atomistic competition. 

40 Arguments which allege that individuals, under capitalism or socialism, decide 
what goods they shall consume, but are not capable of making a correct decision with 
respect to saving seem to suggest that there is a qualitative difference introduced by 
dynamics. I refer to the argument that wealthy men are necessary in order to provide 
for saving and capital formation, and that in a socialist state the government should 
decide what the appropriate amounts of capital formation should be. If one specifies 
a W function satisfying the first six assumptions and includes in it as separate variables 
future goods and services, both of these view's are wTong whether there is capitalism or 
socialism. However, especially where problems of the present and future are concerned, 
modem philosophers balk at the assumption that what people think is best for them 
really is. Naturally, when the individualistic assumption six is dropped, our conditions 
are altered. But the same would be true if we denied this assumption in a statical world. 



CHAPTER IX 


THE STABILITY OF EQUILIBRIUM: COMPARATIVE 
STATICS AND DYNAMICS 

Introduction 

It was an achievement of the first magnitude for the older mathe- 
matical economists to have shown that the number of independent 
and consistent economic relations was in a wide variety of cases 
sufficient to determine the equilibrium values of unknown economic 
prices and quantities. Since their life spans were only of finite 
duration, it was natural that they should have stopped short at 
this stage of counting equations and unknowns. It remains to be 
explained, however, why in the first quarter of the twentieth cen- 
tury economists should have been content with what was after all 
only preliminary spade work containing in itself (at least ex- 
plicitly) few meaningful theorems of observational significance such 
as could even ideally be empirically refuted under any conceivable 
circumstances. 

It is the task of comparative statics to show the determination 
of the equilibrium values of given variables (unknowns) under pos- 
tulated conditions (functional relationships) with various data 
(parameters) being specified. Thus, in the simplest case of a 
partial-equilibrium market for a single commodity, the two inde- 
pendent relations of supply and demand, each drawn up with other 
prices and institutional data being taken as given, determine by 
their intersection the equilibrium quantities of the unknown price 
and quantity sold. If no more than this could be said, the econo- 
mist would be truly vulnerable to the gibe that he is only a parrot 
taught to say “supply and demand.” Simply to know that there 
are efficacious “laws” determining equilibrium tells us nothing of 
the character of these laws. In order for the analysis to be useful 
it must provide information concerning the way in which our equi- 
librium quantities will change as a result of changes in the param- 
eters taken as independent data. 
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In the above illustration let us consider tastes as a changing 
parameter influencing only the demand curve Will an increase in 
demand raise or lower price? Clearly the statement that before 
and after the assumed change price is determined by the inter 
section of supply and demand gives us no answer to the problem 
Nothing can be said concerning the movement of the intersection 
point of any two plane curves as one of them shifts And yet most 
economists would argue that in a wide variety of circumstances 
this question can be given a definite answer — namely that price 
will increase 

How is this conclusion derived ? h or few commodities have we 
detailed quantitatu e empirical information concerning the exact 
forms of the supply and demand cur\cs even in the neighborhood 
of the equilibrium point Not only would large amounts of time 
and money be necessary to get such information but in many cases 
it is practically impossible to derive useful emptneal information 
concerning what would happen if virtual changes in price con 
fronted the demanders or the suppliers 

This is a typical problem confronting the economist in the 
absence of precise quantitative data he must infer analytically the 
qualitative direction of movement of a complex system What 
little success he has hitherto achieved can be classified in large part 
under two headings (1) theorems proceeding from the assumption 
of maximizing behavior on the part of firms or individuals and 
(2) stability conditions relating to the interaction between economic 
units Although inadequately explored until comparatively re 
cently the first type of conditions is best know n and w ill not be 
dealt with here except incidentally As will become evident later 
however from certain points of view they can be fitted in as special 
cases of the second set It is the central task of this chapter to 
show how the problem of stability of equilibrium is intimately tied 
up with the problem of deriving: fruitful theorems in comparative 
statics This duality constitutes what I have called the corre 
spondence principle 


Comparative Statics 

The problem may be approached in full generality by consider 
mg n unknown variables (*i * B ) whose equilibrium values are 

v be determined for preassigned values of a parameter a We 
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assume n independent and consistent, continuously differentiable 
implicit relations involving some or all of the unknowns and the 
parameter a; or 


fix 1 , a) = 0. ( 4 = 1, •••,*) 

These determine a set of equilibrium values 1 

x i° = £.(<*)• 

We wish to determine the sign of 


dxi° 

da 


*/(«)• 


( 1 ) 

( 2 ) 

(3) 


Differentiating (1) totally with respect to a, we can express this as 


dx,° 

da 


£ fi a * 



(4) 


where the subscripts indicate partial differentiation, 

/l 1 /2 1 fn' 

A _ /l 2 /** ••• /»* 


l/l" /*" ••• /-"I 

and Aj,- is the cofactor of the element in the jth row and 4th column 
of A. 

Unless some a priori restrictions are placed upon the nature of 
the elements involved in these determinants, no useful theorems 
can be derived. Each unknown derivative depends upon an 
n{n + 1) infinity of possible values. If the various determinants 
were expanded out, a sum of n ! terms would appear in the denomi- 
nator and in the numerator. Regarded simply as chance drawings 
taken at random from a hat, the probability that the signs of these 
would all agree would go rapidly to zero as the number of variables 
increased. Fortunately, as will be shown, the analysis of the 
stability of equilibrium will aid in evaluating these complicated 
expressions. 

1 If for a given value of a = ai, there exists a solution (xf, •••,*„»), and if the 
matrix (j9 /'/<5x/] is of rank n in a neighborhood of (x°), then by the implicit-function 
theorem equations (2) represent single-valued continuously differentiable functions in 
a sufficiently small neighborhood of (oti, x°). 


H/dl, 
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In the simple example of supply and demand alluded to above 
our unknowns are (j>, g), and our equilibrium system can be written 

Iff -£(*>«) =0 (Da >0 D P < 0) 

1 ff - S(p) = 0 W 


where a is a parameter of shift representing taste and D p is 
usually assumed to be less than zero Also 


da 


Da“ 


*£ . 



( 6 ) 

(7) 


Whether or not pnee will increase when demand increases is 
seen to depend upon the algebraic difference be tv een the slopes (re 
ferred to the price axis) of the demand and supply curves at the 
equilibrium point Quantity will increase only if the slope of the 
supply curve is of the same sign as this algebraic difference If the 
system is stable in the sense of Walras it can be shown that the 
supply curve must have a greater algebraic slope than the demand 
curve so that price will necessarily increase, the change in quantity 
is necessarily of ambiguous sign depending upon whether the supply 
curve is positively inclined oi so called backward rising 2 


Stauiuty and Dynamics 

Before deriving explicitly the Walrasian stability conditions re 
ferred to abo\ t I turn to a discussion of the meaning of a stable 
equilibrium This will be found to presuppose a theory of dy 
namics namely a theory which determines the behavior through 
time of all variables from arbitrary initial conditions If we have 
given n variables [*»(/), , x„(/)] and n functional equations of 

’ The distinction suggested bv Mr Kahn between forward falling and backward 
rising negatively incl ned supply curves while suggestive does not rest upon a dynain c 
analysis of the attainment of equil br um and so does not adequately come to gr ps with 
the problem in all its complexity Cf R F Kahn The Elasticity of Substitution and 
the Relative Share of a Factor Review of Economic Studies I (1933) 72 78 also N 
Kaldor A Classifactory Note on the Determinateness of Equ 1 bnum Renew of 
Economic Studies I (1933) 122 136 

The suggest on of Professor Vmer that the latter type of curve gives the maximum 
amount forthcom ng at a given price while the former does not will receive ampl fi cation 
in the course of this argument 
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the general form 

FOiCt), x 2 {t), 


l 

■,x n (r), /] = 0, (* = (8) 


then their behavior is determined once certain initial conditions are 
specified. 3 Examples of functional equation systems are given by 
sets of differential, difference, mixed differential-difference, integral, 
integro-differential, and still more general systems. Following the 
excellent terminology of Professor Frisch, 4 stationary or equilibrium 
values of the variables are given by the set of constants (xj°, • • • , x„°) 
which satisfy these equations identically, or 


t l 

FT*! 0 , x 2 T 


1 

■ , x n °, t] = 0. 5 (t = 1, •■•,») 


( 9 ) 


If the system has always been in equilibrium up until time t°, it 
will subsequently continue in equilibrium. However, the equi- 
librium values (xi°, • • • , x n °) may be attained or even be maintained 
for a finite period of time, and yet because of generalized dynamical 
“inertia” it need not (and in general will not) remain in equilibrium 
subsequently, but may well “overshoot” the mark. 

The equilibrium position possesses perfect stability of the first 
kind if from any initial conditions all the variables approach their 
equilibrium values in the limit as time becomes infinite; i.e., if 

lim Xi(t) = xf, (10) 

/—♦CO 

regardless of the initial conditions. Alternatively, it is sometimes 
stated that an equilibrium is stable if a displacement from equi- 

3 What constitute initial conditions depends upon the nature of the functional equa- 
tions. For differential systems only values of the coordinates, velocities, and higher 
derivatives at some initial time need be specified. For difference equations defined only 
for integral values of t the same is true, where differences replace derivatives. In t c 
general case values of the variables over a continuous time interval, possibly stretc mg 
back to — oo, are required to constitute a complete set of initial conditions. 

4 R. Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of Economic 
Studies, III (1936), 100-105. 

6 Of course, such a set need not exist. Thus, the simple system 


dx 

dt 


= e 1 — x 


has no stationary equilibrium values since e* — x — 0 has n0 rea ' r00ts ' 
dx/dt = 1 defines no stationary equilibrium position. 


Similarly, 
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hbnum is followed by a return to equilibrium A displacement is 
equivalent to an arbitrary change in the initial conditions and is 
possible only if some of our functional equations are momentarily 
relaxed or if our system is enlarged to include impressed forces 
or shocks 

Stability of the first kind in the small exists if for sufficiently 
small displacements the equilibrium is stable Stability in the 
small is contained within perfect stability but not vice versa A 
system may be stable for small finite displacements but not for 
large displacements Nevertheless, stability in the small is a 
necessary condition for perfect stability and will be analyzed here 
m greatest detail 

It should be pointed out that no conservative dynamical system 
of the type met in theoretical physics possesses stability of the first 
kind If one displaces a frictionless pendulum, it will oscillate 
endlessly around the position of stable equilibrium * Its motion 
is bounded, however and it never remains on one side of the equi- 
librium position for more than a finite time interval Such be- 
havior may be characterized as stability of the second kind or as 
stability in the second sense As before a distinction can be made 
between stability of the second kind in the small and complete 
stability of the second kind For the most part in the present 
investigation I shall be concerned with the problem of stability of 
the first kind 

The equations of comparative statics are then a special case of 
the general dynamic analysis They can indeed be discussed ab- 
stracting completely from dynamical analysis In the history of 
mechanics, the theory of statics was developed before the dy 
namical problem was even formulated But the problem of sta 
bility of equilibrium cannot be discussed except with reference to 
dynamical considerations, however implicit and rudimentary 1 
’We find ourselves confronted with this paradox in order for the 
comparative statics anal> sis to yield fruitful results, w e must first 

* A dynamical system into which friction is introduced via a dissipation function 
may enjoy stability of the first kind On these and kindred matters see G D Birkhoff 
Dynamical Systems (New York 1927) 

7 This is seen to be involved in the virtual work analysis and in the minimum 
potential energy condition characteristic of a stable statical ( stationary ) equilibrium 
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develop a theory of dynamics. 8 This is completely aside from the 
other uses of dynamic analysis as in the studies of fluctuations, 
trends, etc. I turn now to some illustrations of these propositions. 

I. In the literary explanations of the process by which supply 
and demand are equated, the assumption is usually made that if 
at any price demand exceeds supply, price will rise; if supply ex- 
ceeds demand, price will fall. Let us state this more precisely as 
follows: 

V = = H(q D - q s ) = H[_D(p, a) - S(p ) ], (11) 

where H( 0) = 0, and H' > 0. 

In the neighborhood of the equilibrium point this can be ex- 
panded in the form 

P = X(ZV - S v °)(p - p°) + • • -, (12) 

where X = (IT') 0 > 0, and where terms involving higher powers of 
( p — p°) are omitted. The solution of this simple differential 
equation for initial price p at time zero can be written at sight : 

p(t) = p° + (p - p°)e HD ’’- (13) 

If the equilibrium is to be stable, 

lim p(t) = p°. (14) 

t-> CO 

This is possible if, and only if, 

zy - V = 0- ( 15 ) 

If in what follows we rule out neutral equilibrium in the large and 
in the small, the equality sign may be omitted so that 

D p ° - S P ° < 0. (16) 

If the supply curve is positively inclined, this will be realized. If 
it is negatively inclined, it must be less steep (referred to the price 
axis) than the demand curve. If our stability conditions are 
realized, the problem originally proposed is answered. Price must 
rise when demand increases. 

8 The point made here is not to be confused with the commonplace criticism of 
comparative statics that it does not do what it is not aimed to do, namely describe 
the transition paths between equilibria. 
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1 1 These so called Walrasian stability conditions arc not necev 
sarily the only ones * If alternative dynamic models arc postu 
lated completely different conditions arc deduced which in turn 
lead to alternative theorems in comparative statics 

Thus m Marshall s long run theory of normal price the quan 
tity supplied is assumed to adjust itself comparatively slowly If 
demand price exceeds supply price the quantity supplied will 
increase Preserving our notation of equations (5), remembering 
that quantity rather than price is regarded as the independent 
variable and neglecting higher order terms we have the following 
differenti it cqu ition 

’ "*(5? (i>0) (17) 

whose solution is 

g(0 = g° + (3 - 3°)c* (,/0 ' ~ us ' v (13) 


If the equilibrium is to be stable 

1 1 15/- D„° 

Z>/ 5/ D p ° S p * 


< 0 


(19) 


ic the demand curve slope referred to the quantity axis is less 
algebraically than that of the supply curve Since the demand 
curve is negatively inclined 



( 20 ) 


Referring back to equations (7) vve see that Marshallian stability 
conditions require that quantity increases when demand increases 
in every case while the change in price is necessarily ambiguous 
depending upon the algebraic sign of the supply curve s slope 
It is to be pointed out that this forward falling supply curve is 
not a true supply curve in the sense of the amount forthcoming at 
each hypothetical price, although it is a true supply curve in the 
sense of being the locus of price quantity points traced out by 
fluctuations in sufficiently steep demand curves As such it is a 
reversible long run relation 

* A h stor cal error is involved in the identif cat on of the above stab 1 ty cond t ons 
w th Walras in alleged contrast to those of Marshall which are shortly to be d scussed 
Actually as far back as in the Pure Theory of Fore £n Trade Marshall defined stable 
equ 1 br um in which a so-called backward r s ng supply curve was nvotved much 1 he 
Walras an case 
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III. Still another dynamic model may be considered. It has 
been held that for some commodities supply reacts to price only 
after a given time lag, while price adjusts itself almost instantane- 
ously. This leads to the familiar cobweb phenomenon. Using the 
same notation, our dynamic model takes the form of the following 
difference equations, 


q t = D(p t , a). 


( 21 ) 


In the neighborhood of equilibrium 


(«. - ?») = (%) fo- - 5") 

with the solution 

s. - <r + (s - f) {%)'■ 

Stability requires that 



( 22 ) 

(23) 

(24) 


If the supply curve is positively inclined, it must be steeper abso- 
lutely than is the demand curve, reference being made to the 
quantity axis. In this case the approach to equilibrium is of a 
damped oscillatory nature, every other observation being on one 
side of the equilibrium value. 

If the supply curve is negatively inclined, it must be steeper 
referred to the quantity axis than is the demand curve, precisely 
as in the case of Walrasian stability. The approach to equilibrium 
is asymptotic. As in the Walrasian case, we can deduce the 
theorem in comparative statics that price will necessarily increase 
even though the change in quantity is indefinite. 

It is to be noted that a first-order difference equation is richer in 
solution that the corresponding first-order differential equation. 
Not only does it admit of oscillatory solutions, but the stability 
conditions relate to the absolute value of the root of an equation, 
implying two distinct inequalities. Remembering that D p ° is nega- 
tive, the inequality of (24) can be written 

D P ° <S P ° < - D p °. 

The new inequality tells us that any increase in output as a result 
of an increase in demand cannot be so large as the increase in output 
resulting from an “equivalent” increase in supply. 
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IV. Still a fourth dynamical model that has been considered is 
that of Marshall in the Pure Theory of Foreign Trade. Let Figure 2 
represent the familiar offer curves of two trading bodies (suppliers 
and demanders) respectively. Equilibrium is attained at the in- 
tersection (not necessarily unique) of two such curves If equi- 



librium is displaced, country I is to act in such a way as to change 
the amount of x x in the horizontal direction of its offer curve 
(as indicated by the pointed horizontal arrows). Similarly coun- 
try II adjusts x% vertically in the direction of its offer curve. 
Mathematically, 

i, = Ffi[G(.r,) - *i3, (1 c\ 

*i = H t lF( Xl ) - 

where //,' > 0, //,( 0) = 0, and G{x j) — x t - 0, F(x i) — = 0 
represent the statical offer curves of countries I and II respectively. 
For Hi = Hf and proper units, the following system of differential 
equations will hold in the neighborhood of equilibrium, 

*i = - (xi - *i*) + (G'Y(x 2 - xt°), 

= (F') e (X\ - *,•) - (x t - *,•). Kl0) 

The solution takes the form : 

X\(f) * Xi° + knP lt + Aijc x, ‘ i (27) 

Xi{t) = Xi ° + ktiP ' 1 + 

the k's depend upon the initial values (xj, xj) and the X’s are 
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Clearly 


DO 0 


- 1 - X 

(. F'Y 


') 

1 - X 


X = - 1 ± V( G'YiF'y . 


(28) 

(29) 


The equilibrium will be stable if the real part of X is necessarily 
negative, or 

i?(X) < 0. (30) 


If both (G')° and (F')° are of opposite sign (e.g., if one has an elastic 
demand, the other an inelastic demand), this condition will neces- 
sarily be satisfied. The solution will be oscillatory, but damped, 
approaching equilibrium in a spiral as shown above in Figure 3, 
and obeying an equation of the form 


X, = x x ° + e~ l (a,i sin 6t -j- b{ cos Qt). (i = 1,2) (31) 

If both are positive, however (each with elastic demands), then 


and 


V(G , ) 0 (F , )° < 1, 

(32) 

(G')°(F')° < 1, 

(33) 

(fi')’ < (ry 

(34) 


In terms of the slopes of both offer curves referred to the xi axis 

(ID, > (ID.,- (35) 

The equilibrium is approached asymptotically. 

If both curves are negatively inclined, stability requires 


fdx 2 \ ( dxA 
\dxi)i \dxiju 


(36) 


Clearly, the general condition when the curves are of like sign can 


be written 


dx 2 


dx-i 

dx 1 

> 

I 

dx 1 


(37) 


and the approach to equilibrium will in every case be asymptotic. 10 

10 Somewhat paradoxically, in this case positions of stable equilibrium need not be 
separated by positions of unstable equilibrium because of the possibility of complex roots. 
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The stability conditions derived here will be found, if translated 
into terms of supply and demand curves rather than offer curves, 
to imply differing and inconsistent conditions from those of the 
preceding cases. 

V. In the four cases considered I have been concerned with 
problems of stability of the first kind Following a suggestion of 
Dr. Francis Dresch of the University of California, let us suppose 
that price falls not when the instantaneous supply exceeds demand, 
but only when accumulated stocks exceed some normal value, Q°, or 

V = HQ’ - Q) + = M2* - - i *)dt, (! > 0) (38) 

since the stock equals the accumulated difference between the 
amount produced and the amount consumed. Differentiating with 
respect to /, neglecting terms of higher power, and redefining our 
time units so as to suppress the dimensional constant X we would 
have 

f = (D,» - S r ')(p - n (39) 

whose solution is 

pit) = p° + + Cie -«D^s7.)t t ( 40 ) 

the c’s depending upon the initial price and price change. 

Only if 

ZV - *V < 0, (41) 

can explosive behavior be avoided. If the above inequality is 
realized, however, the square root will be a pure imaginary number 
so that the solution takes the form of an undamped harmonic: 

PiP) — bi cos — D p ° t + bt sin 'iS p a — D f ° t + P°- (42) 

Thus, if we require our second-order differential equation to have 
at least stability of the second kind, we come to the same theorems 
in comparative statics as in case I. 

There is at least one serious objection to assuming a nondamped 
system of this kind. If there are superimposed upon our system 
random shocks or errors, these will tend to accumulate so that the 
expected amplitude of the cycles will increase with time. This is 
well illustrated by the familiar Brownian movement of large mole- 
cules under the impact of random collisions. The molecule “takes 
a random walk,” and its mean variance increases with the observa- 
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t.ion time . 11 Before adopting a similar hypothesis in economic 
analysis, some statistical evidence of its possible validity should 
be adduced . 12 

We have now surveyed five different dynamic setups and re- 
lated stability conditions, all referring to a simple one-commodity 
market. Except possibly for cases IV and V, all are mathemati- 
cally trivial. Unaided intuition or simple geometrical methods 
serve to reveal sufficient conditions for stability. They are of sig- 
nificance, however, because each has played an important part in 
the history of economic science ; and precisely because of their sim- 
plicity they provide a useful illustration of the general principle 
involved. In the following sections I shall be concerned with more 
complex problems. 

The Stability of Multiple Markets 

While it might be more elegant at this stage to develop formally 
for general systems the fundamental principles illustrated thus far, 
our foregoing discussion provides a very convenient opening for 
an examination of a problem which has received considerable at- 
tention lately at the hands of Professor Hicks. In Value and 

11 Random shocks are not necessarily to be regarded as a nuisance. In their absence 
friction might imprison the system at some fixed level other than the “true" equilibrium 
level (friction being disregarded). Often random shocks serve to insure the realization 
of average values nearly equal to the equilibrium ones, just as iron filings placed upon 
a piece of paper over a magnet assume the lines of force of the magnetic field when 
gently tapped. 

1! One can avoid an undamped system by assuming that price tends to fall not 
only when stocks are large, but also when current supply exceeds current demand; 
i.e., when stocks tend to accumulate. Then we have 

p = a [ Q° - f o ' (qs - qo)dt J - P(qs - ?z>) 
or 

p = a(ZV - S p °)p + f3(ZV - Sp°)P- («- 0 > °) 

The equilibrium is stable only if 

R(\) < 0, 

or if 

X J -f3(D p ° - S/)\ -a (D p ° - S p °) = 0, 

or if 

ZV - V < 0. 

This agrees with the condition of case I and the one just derived. In fact, each of these 
is a special case when one of the coefficients vanishes. For intermediate values, the 
solutions range continuously between damped harmonic motion and exponential 
approach to equilibrium. 
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Capital , Chapter VIII and Mathematical Appendix, §21, he has at- 
tempted to generalize to any number of markets the stability con- 
ditions of a single market The method of approach is postula- 
tional ; stability conditions are not deduced from a dynamic model 
except implicitly. 13 Propositions which are deduced here as 
theorems are assumed as definitions of stability. 

For a single market, according to Professor Hicks, equilibrium 
is stable if an increase in demand raises price (This rules out in 
the beginning cases II and IV.) For multiple markets equilibrium 
is imperfectly stable if an increase in demand for a single good raises 
its price after all other prices have adjusted themselves; the equi- 
librium is perfectly stable if an increased demand for a good raises 
its price even when any subset of other prices is arbitrarily held 
constant (by means of a relaxation of other equilibrium conditions). 

To test the necessity or sufficiency of these criteria in terms of 
a more fundamental definition of stability of equilibrium let us 
make a natural generalization of the Walrasian conditions of the 
following form : the price of any good w ill fall if its supply exceeds its 
demand, these each being regarded as functions of all other prices. 

Mathematically, 

P. = - Hfq s % - g D ‘) 

= - Hiartpu . p .) - . ?.)] 

= • . (43) 

where 

0 = qs'iPi, ,pn) - qo'ipi, •,/>*) = - qfpu • • ,pn ) (44) 
represent statical equations of supply and demand, a if represents 
the partial derivative of q , with respect to the jth price evaluated 
at the equilibrium set of prices In general, a,f 5 ^ a," u It is 

11 It is true that on page 70 a hint of a dynamical process creeps into the discussion 
The approach to equilibrium seems to be regarded as taking place in finite steps at dis 
Crete intervals of time, 1 e , in accordance with certain difference equations Correctly 
stated, this argument would not lead to essentially different stability conditions from 
iny system of differential equations discussed below, as the later general discussion 
will disclose 

14 If the demand and supply were each drawn up with reference to firms maximizing 
profit, well known integrabihty conditions would guarantee this equivalence On the 
consumer’s side there need be no such equivalence, and if we consider a consumer whose 

1 purchases balance his total sale of productive services, such an equality for every 
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instructive to consider first, however, the symmetrical case (such 
as characterizes markets made up exclusively of entrepreneurs) * 
and let the speeds of adjustments, H/, be set equal to one. The 
solution of equations (43) can be written 


Pi(t) = Pi 0 + £ kijfr', (45) 

i=i 

where (Xj, • • • , X n ) are latent roots of the characteristic equation 
and 



d n ° 

— X &12° * * 

• dif 

/(x) = 

dif 

dif — X 

dif 


d n f 

d n f 

• d n f - X 

- 

a — 

X/j = | dif - X5,y| 

= 0, 


(46) 


the k's depend upon the matrix a and upon the initial conditions. 15 
As before, stability requires i?(Xy) < 0. 

By a well-known theorem of Hermitian matrices, in the sym- 
metrical case all the roots are necessarily real. If the equilibrium 
is to be stable, they must all be negative. According to a classical 
theorem, this is possible if and only if a is the matrix of a negative 
definite quadratic form; i.e., only if all principal minors alternate 
in sign as follows: 


I < 0; 


(Li 


a if 


djf djj 


> 0 ; 


dif dif 
djf djf 
dkf dkf 


dif 

djf 

dkf 


< 0 , 

i t* j k i. (47) 


combination of goods and services would, strictly interpreted, lead to an absurdity; it 
would imply expenditure proportionality and, hence, zero consumption of every good and 
zero offering of every service! For the general demand or supply function we need not 
expect a canceling off of “income effects” since individuals usually face firms in consump- 
tion and factor markets. 

15 If the roots are not distinct, polynomial terms of the form if', Pf 1 , , 
appear where (5 + 1) is the multiplicity of a repeated root. In any case the problem 
of stability depends only upon the X's and is unaffected by such multipliers since the 
exponential always governs the asymptotic behavior of the solution whenever dampening 
does occur. 
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Any ratio of the form 


1 

0 

0 

0 

a » 

d,k 0 

0 

a k ,° 

atk° 


o./ 

a,k° 

«*.• 

a,/ 

&jk a 



flkk° 


m 


is necessarily negative in sign But such ratios are precisely equal 
to the change in the price of the nh good with respect to a unit 
increase in its net supply when appropriate subsets of other prices 
are held constant, so that for this case the stability criteria of Pro 
fessor Hicks are seen to be correct theorems 

Where perfect symmetry is not present (and in business cycle 
analysis it is almost always absent), the Hicks criteria are not at 
all necessary conditions and in many cases not sufficient 14 A 
system may possess stability of the first kind even though neither 
perfectly nor imperfectly stable tn Hicks’s sense I long suspected 
that perfect stability is a sufficient condition for stability of the 
first kind But this conjecture proved to be false Perfect sta 
bihty, like imperfect stability, is neither a necessary nor sufficient 


11 A word of warning may be in order concerning the indiscriminate use of either 
prices or quantities as independent variables Th s leads to contradictory defin tions of 
complementarity in the literary discussion on page 44 and the mathemat cal definitions 
on page 311 the inconsistency between which can lead to opposite signs Such an 
interchange of independent variables (as between the literary definition of stability on 
page 63 and the mathematical conditions on 315 and 325) is particularly important 
where nonsymmetrical matrices are concerned Docs 


imply 


dx 

dp, 

dx, 

dp 


dXt\ 


dp, 

dx. 


> 0 


dp, | 


where 


dx , dx, 
dp, dp 


d ± 

dx 

dpi 


dp 

dx, 

dp 


>0 


dx dx 


? 


Ihe answer is yes but tie proof is not simple Lv<_ \ with symmetry the prod if cl 
? /dx ){dx /dp ) need not be of posit ve sign if more than two i ariables are involved 
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condition. 17 In any case it is too strict a condition, Avhile the re- 
quirement of imperfect stability is not strict enough; only in the 
case of symmetry do these limits converge. Why any system 
should be expected to possess perfect stability, or why an economist 
should be interested in this property is by no means clear. Not 
working with an explicit dynamical model, Professor Hicks prob- 
ably argued by analogy from well-known maximum conditions, 
whereby a maximum must hold for arbitrary displacements and 
through any transformation of variables. As a result, some vari- 
ables may be made constants, and with respect to the remaining 
arbitrary subsets the definiteness of various quadratic forms must 
be insured. On the other hand, in terms of a truly dynamic process 
the equilibrium must be stable for arbitrary initial conditions or 
displacements and for arbitrary nonsingular transformations of 
variables, but not necessarily for arbitrary modifications of the 
dynamic equations of motion such as are involved in the Hicks 
procedure of holding subsets of other prices constant (by violating 
or relaxing true dynamical relations). In principle the Hicks pro- 
cedure is clearly wrong, although in some empirical cases it may be 
useful to make the hypothesis that the equilibrium is stable even 
without the “equilibrating” action of some variable which may be 
arbitrarily held constant. (In connection with the Keynesian 
model later discussed, an example of this is presented.) 

To summarize: for every case true necessary and sufficient 
stability conditions are that R(fj) < 0, where represents the 


17 The matrix 


- £ 10 0 " 

0 € 1 0 

0 0 « 1 

.-1 +1 -1 1 + e - 


for sufficiently small e's has all principal minors positive and yet has some roots whose 
real parts are negative. This shows that even Hicksian perfect stability does not 
guarantee dynamic stability. See P. A. Samuelson, “The Relation between Hicksian 
Stability and True Dynamic Stability,” Economelrica, XII (1944), 256-257. Also, as 
Metzler has recently shown, the Hicksian conditions are necessary (but not sufficient) if 
the system is to be stable for all possible (positive) rates of adjustment in different 
markets; and if all off-diagonal terms are non-negative, the Hicks conditions are both 
necessary and sufficient for stability. L. A. Metzler, “Stability of Multiple Markets. 
The Hicks Conditions,” Economelrica, XIII (1945), 272-292. For a proof that the 
inverse of the last matrix must consist of all elements of the same sign, see J. Mosak, 
General Equilibrium Theory in International Trade (Bloomington, Indiana: Principia 
Press, 1944), p. 49. 
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latent roots of the matrix a This is not equivalent to the Hicks 
conditions 18 

In an address before the Econometric Society Professor Lange 19 
suggested that the speed of reaction might be different in every 
market so that instead of writing 

Pi = H'q, + (49) 

as m (43) we write 

Pi ~ H t g> + (50) 

where H, = H,'( 0) is the (positive) coefficient appropriate to the 
ith market 

Even this is not sufficiently general unless a particular com 
modity classification is to be favored and if our formulation is to 
be invariant under linear transformations of commodities and 
prices We have no choice but to admit that the rate of adjust 
ment in one market may depend on the excess demand m other 
markets, so that m matrix terms 

P « Hq + *= Hap + , (51) 

where q is the column of q,’s a the square matrix a,/ and where H 
need not now be a positive diagonal matrix It would seem 
reasonable at first to require H to be positive definite, or at least to 
be positive quasi definite as the latter expression wms defined in 
chapter vi But if we apply the contragredient transformation c 
as given in equation (47) of chapter vi, 


q ~ Cq, q = c~'q 

p „ c ^p p = c >p 


(52) 


** The following illustrations bear this out The system 
pi = — 2pi -f 4 pi 
Pj = — pi + pi 

possesses stability of the first kind but is neither perfectly nor imperfectly stable 
The system 

pl = pi — Pj 
pi = 2 pi + pi 

is imperfectly stable but departs ever further from equilibrium 

11 0 Lange Abstract of The Stabil ty of Economic Equilibrium Econometrica 
X (1942) 176-177 Price FlexibtUly end Employment (Bloomington Indiana Pnncipia 
Press 1944) appendix. 
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then it can be verified that 

£ = 8q-\ c'Hq + • - • = Hap + • • • = (c'Hac'-')p +■•■. (53) 

It follows that our restrictions cannot be placed upon H alone, 
but only upon the product of H and a; that is, the results depend 
upon the statical matrix a and upon the dynamical reaction-speed 
matrix H. It is sufficient for stability that in some coordinate 
system H~ 1 be positive quasi-definite and a negative quasi-definite, 
but these conditions are by no means necessary. 

Before leaving the problem of stable multiple markets, I should 
like to sketch the effect of the introduction of stocks and its rele- 
vance to stability of the second kind. Let price fall not when cur- 
rent supply exceeds current demand, but when existing stocks (ac- 
cumulated over time from the divergence of current production and 
consumption) exceed an equilibrium amount. Then for proper time 
units and equal speeds of adjustment 

Pi = Qi° - f ( qs - q D )dT = Qi° + f E a»°(Pi ~ P°)dr + ■•■ 

t/0 «/0 /“I 

or 


h = toiffa - Pf) + (54) 

;=1 


whose solution takes the form 

pi{t) = pP+Z + h ije -^‘ ), (55) 

i- J. 

where | - X>5, 7 | = 0, and where for unrepeated roots the k’s 

and the h’s are constant depending upon initial conditions. Clearly 
the motion will be explosive and undamped unless Vxj are all pure 
imaginary numbers; i.e., unless X,- is real and negative. 

If the system is symmetrical, this clearly leads to the same 
conditions as those for stability of the first kind. If not sym- 
metrical, the substitution of second derivatives everywhere for first 
derivatives (through the hypothesis of dependence upon accumu- 
lated stocks rather than instantaneous flows) implies more rigid 
conditions upon the coefficients to insure stability of the second 
kind than were previously required to insure stability of the first 
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kind. This is of course because of the requirement that the roots 
be real as well as negative. 50 

Analysis of the Keynesian System 

Up until now I have considered examples drawn from the field 
of economic theory. The techniques used there are of even more 
fruitful applicability to problems of business cycles. To illustrate 
this I shall analyze in some detail the simple Keynesian model as 
outlined in the General Theory. Various writers, such as Meade, 
Hicks, and Lange, have developed explicitly in mathematical form 
the meaning of the Keynesian system 51 The three fundamental 
relationships stressed by Keynes are (1) the consumption function 
relating consumption (and hence savings-investment) to income, 
and for generality tp the interest rate as well, (2) the marginal 
efficiency of capital relating net investment to the interest rate and 
to the level of income (as of a fixed level of capital equipment, fixed 
for the short period under investigation) ; (3) the schedule of liquidity 
preference relating the existing amount of money to the interest 
rate and the level of income. 

Mathematically, these may be written as follows. 

C(t, Y) — Y + I= — a, (56) 

F(t, Y) — I = — 0, (57) 

L(i, Y) ~ M, (58) 

where t, Y, I stand respectively for the interest rate, income, and 
investment , C, F, L stand respectively for the consumption func- 
tion, the marginal-efficiency-of-capital schedule, and the schedule 
of liquidity preference. M stands for the existing amount of 
money, taken as a parameter; a is a general parameter representing 

10 One could consider the generalization of the intermediate hypothesis of footnote 
12 where price change depends upon stocks and flows, namely, 

ft = Z «ti *[ apt + - />,“)] ( ^ > 0 ) 

If stable for 0 > 0, a = 0, and also for 0 = 0, a > 0, It can perhaps be proved to be 
stable for all intermediate cases 

a J E Meade, "A Simplified Model of Mr Keynes’ System,” Review of Economic 
Studies, IV (1937), 98-107, J R Hicks, “Mr Keynes and the ‘Classics’, A Suggested 
Interpretation,” Economeinca, V (1937), 147-159, Oskar Lange, “The Rate of Interest 
and the Optimum Propensity to Consume,” Economica, V (1938), 12-32 
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an upward shift in the propensity-to-consume schedule; similarly 
as the parameter /3 increases, the marginal-efficiency schedule shifts 
upward. 

We have three relations to determine the three unknowns in 
terms of three parameters, viz. : 


i = i{a, M), 

Y = Y(a, /3, M), (59) 

I = /(«, /3, M). 

As explained in the first section of this part, the usefulness of the 
Keynesian equilibrium system lies in the light it throws upon the 
way our unknowns will change as a result of changes in data. 
More specifically, what are the signs of 


di 

dY 

di 

da ’ 

da ’ 

da ’ 

di 

dY 

di 

~d(f’ 

d/3 ’ 

d /8 ’ 

di 

dY 

iL? 

dM 

dM' 

dM ’ 


Differentiating totally with respect to our parameters and evaluat- 
ing the resulting linear equations, we find 


di _ 

— Ly 

dY _ 

Li 

di FyLi — FiLy 

da 

A ’ 

da 

A ' 

da A 

di 

— Ly 

dY 

Li 

di (1 — Cy)Li + CiL 

d/3 

A ’ 

d/3 

A ’ 

d/S — A 

di 

1 — Cy 

- Fy 

dY 

Fi + Ci 

dM ~ 

A 

> 

dM 

~ A ’ 

di 

Fy(Fi + Ci) + (1 - 

Cy — Fy)Fi 

dM " 



A 

f 


= Ly{Fi + C,) + L,(l - Cr - Fr)- ( 61 ) 


where 

Ci Cr - 1 1 

A = F t F r - 1 
Li Ly 0 

On the basis of a priori, intuitive, empirical experience the 
following assumptions are usually made: 

Cy >0, Fy> 0, Fi <0, Ly > 0, Li < 0, (62) 
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while 

c,l|o 

and is usually assumed m modem discussions to be of minor quan 
titative importance 

In order to evaluate our nine derivatives we must be able to 
determine unambiguously the signs of all numerators as well as the 
common denominator, A A consists of five terms, two of which 
are of positive sign, two of negative sign, and one ambiguous On 
the basis of deductive analysis along stnctly statical lines nothing 
can be inferred concerning its sign Moreover, even if the sign of 
A were determined, all but four of the nine would be found to have 
numerators of indeterminable sign 

This is a typical case If we are to derive useful theorems we 
must clearly proceed to a consideration of a more general dynamic 
system which includes the stationary Keynesian analysis as a spe 
cial case This can be done m a variety of alternative ways I 
shall consider two, the first of which is based upon a differential 
system and yields quite definite results 

Case 1 Let us assume as before that the second and third 
relations of marginal efficiency and liquidity preference work them 
selves out m so short a time that they can be regarded as holding 
instantaneously Let us assume, however, that I now represents 
"intended” investment, and this magnitude equals savings invest 
ment only in equilibrium, i e , when all the variables take on sta 
tionary values If, however, because of some change, consumption 
(say) should suddenly increase, national income not having a 
chance to change, actual savings investment would fall short of 
"intended ' investment because of inventory reduction, etc Con 
sequently, income would tend to rise Similarly an excess of actual 
savings investment over intended investment would tend to make 
income fall Mathematically, this hypothesis may be stated as 
follows the rate of change of income ts proportional to the difference 
between intended samngs-invesiment and actual savings investment 
The discussion here is unrelated to the controversy over the equal 
ity of savings and investment despite possible appearances to the 
contrary The superficial resemblance between my formulation 
and the Robertsonian identities whereby the difference between 
investment and savings is the time difference of income should not 
m,s !ead the careful reader 
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Equations (56), (57), and (58) are replaced by the dynamical 

Y = / - [F - C(i, Y) - a], 

0 = F(i, y) — / + ;S, 

0 = L(i, Y) - M. 

The solution of these is of the form: 

Y = Y° + cie x ‘, 

i = i° + o 2 e x ', 

I = 1° + aze u , 


(63) 

(64) 

(65) 


( 66 ) 


where 


A(X) = 

The equilibrium is stable only if 


Ci Cy — 1 — X 1 
Fi Fy — 1 

Li Ly 0 


But Li < 0; therefore, 
unambiguously. 


-i <0 - 

A < 0 


= A + \L { = 0. (67) 

( 68 ) 

(69) 


This establishes four theorems: an increased marginal efficiency 
of capital will (1) raise interest rates and (2) raise income; an 
increased propensity to consume will (3) raise interest rates and 
(4) raise income. But how will the creation of new money affect 
interest rates? This can be answered by considering more string- 
ent stability conditions. Let us suppose that the interest rate 
were kept constant (say) by appropriate central bank action. 
This assumption is equivalent to dropping the liquidity preference 
equation (65) and treating i as a constant in the remaining equa- 
tions. If the equilibrium is stable under these conditions, we 
must have 


or 


Cy — 1 — X 
Fy 


= 0 = (1 — CV — Fy) + X, 


(70) 


- X = (1 - Fy - Cy) > 0. (71) 

This leads to another important theorem: (5) the marginal pro- 
pensity to consume plus the marginal propensity to invest cannot 
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exceed unity or the system will be unstable (as of a fixed interest 
rate) 22 It also tells us (6) that an increase in the amount of 
money must, ceteris paribus lower interest rates 

We are left w ith four ambiguities of sign Tw o of them depend 
upon the fact that savings may vary in any direction with respect 
to a change in interest rates If we assume that normally savings 
out of a given income increase with the interest rate, or, if they do 
decrease do so not so much as docs investment, then three more 
theorems become true an increase in the amount of money (7) 
increases income and (8) increases investment (9) an increase in 
the marginal efficiency schedule increases investment There re 
mams a final ambiguous term What is the effect upon investment 
of an increased propensity to consume? This is seen to be essen 
tially ambiguous depending upon the quantitative strengths of the 
liquidity preference slopes and the marginal efficiency slopes As 
income increases money becomes tight because of the need for 
financing more transactions This tends to depress investment 
As an offset the increase in income tends to increase investment 
through the marginal propensity to invest Which effect will be 
the stronger cannot be decided on a prion grounds 

I have prepared a 3 X 3 classification indicating the signs of 
the nine terms All but four have definite signs Of these four 
one is essentially ambiguous as indicated by a question mark. The 
remaining three show under question marks their normal pre 
sumptive signs 


Increase in propensity to consume 
Increase in marginal efficiency of capital 
Increase m amount of money 

“ If we take investment also as an independent parameter (say through government 
action) we lose equation (57) and have for stability the condition 
ICr— 1 -X[ ~ (O - 1) -X - 0 
X «= C r - 1 < 0 

or that (he marginal propensity la consume must 6e less than one But this is weaker than 
the previous condition in view of the fact that the marginal propensity to invest is 
assumed pos tive 
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Case 2. I now turn to a system based on a difference equation. 
It is founded upon considerations similar to those underlying the 
Kahn-Clark multiplier block diagrams, and for this reason alone 
is worth consideration. In addition, the analytical contrasts be- 
tween differential and difference systems is brought out. Revers- 
ing the order of the previous exposition, let us take investment as 
an independent parameter and the interest rate as a constant. Let 
consumption be a given function of income during the preceding 
period of time: 

C t = C(i, F ( _ x ) = C(Y^). (72) 

What properties must this function satisfy if the equilibrium is 
to be stable? Income clearly equals consumption plus investment: 

Yt = C t + It. (73) 

Recalling that investment is treated as a constant, I, and using 
the consumption relation, we find 

Yt = C(Yt-i) + /, (74) 

or, to a first approximation, 

(Y t - Y°) = Cy°(Y t -i - Y°), (75) 

where 

Y° = C(Y°) + I (76) 

is the equilibrium level of income for investment equal to I. 

The solution of this difference equation takes the form 

Yt = Y° + K(C y °Y (77) 

and is stable only if 

|cy|<l (78) 

or 

- 1 < C Y ° < 1.” (79) 

While the marginal propensity to consume is usually assumed to 
be positive, it need not be so, and still the equilibrium can be a 

23 This inequality is in effect the formal justification of Keynes’ reply to those 
criticizing his fundamental law, that the burden of proof lay upon them to explain 
why, if their allegations were correct, the economic system was not hopelessly unstable. 
See the passages quoted from a letter of Keynes in E. \V. Gilboy, The Propensity to 
Consume: Reply,” Quarterly Journal of Economics, LI 1 1 (1939), 634. While funda- 
mentally correct, Keynes does overlook the possibility of other stabilizers such as . 1 
marginal propensity to invest, interest rate, etc. 
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stable one. Even if it lies between zero and minus one, it is inter- 
esting to observe that the “multiplier” is positive since 


d.Y° 1 

dl 1 - Cy° 


> 0 , 


( 80 ) 


but less than unity because of negative "secondary” effects. 

Let us now drop the assumption that investment is a datum, 
although keeping the interest rate constant. Our dynamic system 
is of the form 

C(i, Y m ) - Y t + 7, - 0. (81) 

F(i, Y,) - /, = 0, (82) 

and the equilibrium is stable only if 

l>'l = |rr ! 7v| <1 ' <h> 

or 

— 1 1 — F»* | < Cy < 1 1 — Fr } . (84) 

Now if the marginal propensity to invest is less than unity 
(1 — F r > 0), 14 this leads to essentially the same stability condi- 
tions as before, namely, the marginal propensity to consume 
plus the marginal propensity to invest must be less than unity 
(Cy + Fy < 1). But, and this is paradoxical, if the marginal 
propensity to invest is sufficiently large, i.e., greater than +2, the 
marginal propensity to consume may exceed unity, and yet the 
equilibrium will be stable! Moreover, beyond a certain critical 
value the larger the marginal propensity to invest, the more stable 
is the system This comes from neglecting the lag between Y and /. 

If we now consider the system in which none of the variables 
are taken as given, namely, 


C(*„ Y t - 1) - Y, + I, = 0, 

F(t tl Y t ) - I, = 0, (85) 

L(i t , Yi) - M = 0, 

“In the marginal-efficiency relation I have made investment depend upon income, 
which itself includes investment Other writers notably Lange {op at), have made 
it depend only upon consumption The result is indifferent since they can be shown 
to be equivalent If, however, it is assumed that dl/dC > 0, the marginal propensity 
to invest, dl/dY = (dI/dC)/[_l + {dI/dC)2 cannot exceed unity If a one-period lag 
is put in (82), then the sum of the consumption and investment propensities must be 
less than one. 
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stability requires that 


N- 


LjCr 
A + LiCy 


< 1 . 


( 86 ) 


In what may be termed the normal case, where the marginal 
propensity to invest is less than unity, this requires as before that 

A < 0, (87) 

and immediately all the eight determinations of sign of Case I 
become correct. 

In the unusual, but possible, case where 


Fy <0 <Cy< ( Fy - 1) - ~ (F» + C t ), (88) 


the equilibrium will be stable, but the signs of our 3X3 table now 
are as follows: 
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In words, the only theorem which remains true under all circum- 
stances is that an increase in the amount of money must lower 
interest rates if the equilibrium is stable. 

This example illustrates the additional complexities which sys- 
tems based upon difference equations involve. Later some of the 
reasons for this will be explained. 

The examples here adduced serve, I hope, to illustrate the light 
which dynamical analysis sheds upon comparative statics. Prob- 
lems in theory and business cycles of any complexity will almost 
surely require similar analytic treatment if useful and meaningful 
theorems are to be derived. 




CHAPTER X 


THE STABILITY OT EQUILIBRIUM LINEAR 
AND NONLINEAR SYSTEMS 

Introduction 

In the previous chapter it was pointed out that there exists an 
intimate formal dependence between comparative statics and dy 
narmes To my knowledge this had not previously been explicitly 
enunciated in the economic literature and for lack of a better name 
I shall refer to it as the Correspondence Principle It is the purpose 
here to probe more deeply into its analytical character, and also to 
show its two way nature not only can the investigation of the dy 
namic stability of a system yield fruitful theorems in statical 
analysis but also known properties of a (comparative) statical 
system can be utilized to dome information concerning the dy 
namic properties of a system 

An understanding of this principle is all the more important at 
a time when pure economic theory has undergone a revolution of 
thought — from statical to dynamical modes While many earlier 
foreshadowings can be found in the literature, we may date this 
upheaval from the publication of Ragnar Frisch’s Cassel Volume 
essay of only a decade ago 1 The resulting change in outlook can 
be compared to that of the transition from classical to quantum 
mechanics And ]ust as in the field of physics it was well that the 
relationship between the old and the new theories could be in part 
clarified so m our field a similar investigation seems in order 
Before entering however, upon these unavoidably technical 
matters a few obiter dicta concerning the fundamental differences 
between statical and dynamical systems may be in order Con 
ceived broadly a dynamical sj stem might be regarded as any set 
of functional equations which together with initial conditions (in 
the most general sense) determine as solutions certain unknowns in 
functron of time According to this definition timeless statical 

1 Ragnar Frisch Propagation Problems and Impulse Problems in Dynamic Eoc- 
n cs Econom c Essays tn Honor of Cuslav Cassel (London 1933) pp 171-205 
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systems are simply degenerate special cases in which the functional 
equations take on simple forms and determine as solutions func- 
tions of time which are identically constants. We may, however, 
define a dynamical system more narrowly so that it will not be 
regarded as truly dynamic if the functional equations involve only 
variables “of the same instant of time,” containing time, if at all, 
only as a parameter. 2 This excludes the customary statical sys- 
tems of the “historical" as well as the timeless variety. 3 It is 
possible, however, that certain subsets of the solutions of the dy- 
namical equations are defined by equations which are structurally 
identical with those which define a statical system. (Thus the sta- 
tionary solution of a time-sequence analysis, say of the multiplier- 
block-diagram variety, may be determined by a formula exactly 
like that of a timeless, instantaneous system.) This constitutes a 
second possible mutual orientation of statical and dynamical 
systems. 

From still a third point of view a statical system can be looked 
upon as the limiting case of a heavily damped dynamical system. 
Thus any statical equation 

/(*) = 0 

which admits a unique solution x° can be related to a dynamical 
system of the form 

f{x t + A(x< - x °) } = 0. 

This yields directly the equivalent linear difference equation 

x t + A(x< - x°) = x°, 
or 

x t+1 = x°. 

Thus, whatever the initial magnitude of x, at the next “instant” 
the system always takes on its correct statical value. This general- 
izes easily to systems of more than one variable. 

In the following I deal with interrelationships between statics 
and dynamics which largely fall under the second of the three 
headings here discussed. 

2 Ragnar Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of 
Economic Studies, III, February, 1936, 100-105. 

3 In chapter xi I deal at some length with the distinction between complete causal 
systems and historical or incomplete causal ones, also with the closely related topic • 
the generalization of the notion of stationary equilibrium to systems involving 
explicitly. 
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Functional Equations and Stationary Solutions 
Starting out with n functional equations which constitute a dy- 
namical system involving « unknown functions }xt(/), , x„(*)}, 

but not time explicitly, 

/*{*i(t),*i(t), .*.WI=0 (* - 1. ,n) (1) 

we define a stationary solution (xf, , x„°) as one for which 

P(xX , xl°J = 0 (2) 


These last equations correspond to a set of ordinary statical func- 
tions in n variables (xi, , x«) 


/■(*.. 

where of course 

,*.) = 0 , 

(* = 1 . 

.«) 

(3) 

/’(* i°. 

, X n *) “ 0 

(* “ 1 , 

,n) 

(4) 


The types of functional equations which have been most studied 
are those defined by differential equations, difference equations, 
and integral equations and mixed varieties The first of these 
possesses the most highly developed theory and provides \aluable 
examples of various principles Since economic observations con 
sist essentially of senes defined for integral values of time, the 
second category of difference equations is perhaps of greatest 
interest to the theoretical economist 

The above classes of functional equations have this much in 
common they can all be written as the limit of an infinite set of 
equations m an infinite number of unknowns However, it is not 
customary to wnte a system of differential equations in the form 
(1), but by use of the singular Dirac 6 function defined to make the 
following identity formally true, 

/(*) = J*"° 5(a — x)f(a)da, (5) 

any linear differential equation can be written as an integral equa 
tion Similarly any integral equation of the form 

w+£*w — a)da = 0, 


(6) 
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where B is analytic and k possesses finite moments of all orders, 
can be written as a differential equation of infinite order, namely, 


where 


B{t) + E c<B'(_t) = 0, 

0 


Ci = 


(— l ) 1 f k(a)a'da 
Jo 


i\ 


(V) 

( 8 ) 


Difference equations and mixed types can also be regarded as 
differential equations of infinite order; by the use of the Dirac func- 
tion or by extension of the definition of integration, they can 
equally well be represented as integral equations. In the following 
treatment I shall investigate formal identities, being little con- 
cerned with problems of convergence, and omitting rigorous proofs. 
There is ample precedent and pragmatic justification for this pro- 
cedure in all applied sciences. 

The greatest attention will be paid to systems of differential 
equations and systems of difference equations. Without loss of 
generality these can be written in the following normal form : 

^ = /*'(* 1 , • • • > Xn), (i = 1, • • • , n) (9) 

and 

&Xi{t) = g*[xi(t), • • •, x n {t)), (10) 

or 

Xi{t + 1) = G*{xi(t), • ■ ■ , x„(t) j = Xi + g\ (11) 

If not already in this form they can be so transformed by the intro- 
duction of new variables. 

For the stationary solutions, 

= 0 = /*(*!, • • ;Xn), (* = 1 , • - •, n) (12) 

or 

A Xi = 0 = g*(x 1 , • - •, x n ) = G i — Xi. {i = 1, •••,») (13) 

Systems of Volterra linear integral equations with Poisson 
kernels of the form 

x<(t) = 4>i{t) + £ f Ku(t - e)xj(e)de, 


(14) 
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are also of interest Unfortunately the theory of nonlinear in 
tegral equations is only fragmentary 

Linear and Nonlinear Systems 
Up until now most economists have concerned themselves with 
linear systems not because of any belief that the facts were so 
simple but rather because of the mathematical difficulties involved 
in nonlinear systems This is understandable and excusable since 
all thought advances m small steps Nevertheless from the stand 
point of a study of industrial fluctuations this can be a rather seri 
outs limitation Thus in a linear system the amplitude of fluctua 
tion depends upon the initial displacement no intrinsic amplitude 
— as between full employment and zero employment — is involved 
The attempt to introduce such a fixed amplitude in a linear system 
by the device of determining coefficients so that there will be 
neither dampened nor explosive solutions appears to be misdirected 
As pointed out in the previous chapter page 268 the stochastical 
dispersion of the system increases indefinitely Related to this is 
the fact that this does not yield a unique amplitude but one which 
depends linearly upon initial conditions 

If we insist that a system be linear and that it do not involve 
time explicitly then for differential and difference systems we are 
restricted to the case of constant coefficients This type is mathe 
matically simple and exact solutions are known But a high price 
is paid for this simplicity in terms of special assumptions which 
must be made 

The Nonlinear Differential Equation in one Variable 
Nevertheless I shall show that the problem of stability of equi 
librium if not that of macrodynamic business cycle analysis de 
pends formally in an important way upon the solution of linear 
systems This may be illustrated by a simple differential equation 
m one variable 

= X = }(X) = A, + A,X + A,X' + (IS) 

where / is analytic and expressible as a power series This equation 
presents no difficulties of solution since by a single quadrature it 
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may be solved ; namely, 

' - * - £m - F(X) - 

Let us suppose a “simple” stationary solution X 
so that 


f(x°) = o, 

f{X°) * o. 
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(16) 
X° exists 


(17) 


Then the transformation of variables 


* = X - X° (18) 

transforms (15) into the form 

E f*(X')x* 

± = f(x + X°) = -2 r-j = 0 4- a\x + cox 2 + • • - , (19) 

1 J 


where a constant term is no longer involved, and cq does not vanish. 
Then we can assert : 

Theorem I : A formal solution of the differential equation {19) can 
be written as an infinite power series in the solution of the simple linear 


system 




x — a\X, 

(20) 

or 




x(t) = E c,{gi(a, /)}’ = E Cijae 0 **}*', 

(21) 


1 1 


where a is a constant depending upon initial conditions A 

That this holds formally (i.e., without regard to the question of 
convergence) can be shown by substitution. Thus, 

x = ffli E icigd - £>,-{£; c*gi*T‘. (22) 

1 1 1 


Expanding out and equating coefficients of the same degree in g u 
we find that each c can be determined in succession from the a’s 

< E. Picard, Trails d' Analyse, III, 185; G. D. Birkhoff, Dynamical Systems (New 
York, 1927), chapter iii. 
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and all previous c’s 

Cl 

Ci 


Cn - M(c n - 1, • , Cl) 

Equation (15) before transformation to the form not involving 
a constant term can in a similar fashion be formally solved by an 
infinite power senes m the solution of the equation 

1 « a,, (24) 

or 

* « £ K,go'(t, a) « £ K,\a<t + «}* (25) 

1 1 

But this is only the conventional power-senes solution in t and tells 
us nothing about the stability of the system , every term taken by 
itself goes to infinity regardless of the o’s, although the infinite 
sum need not 

It is well known that such a senes as that in the exponentials 
is convergent for absolute values of a sufficiently small in a time 
interval sufficiently small But if the system possesses first-order 
stability, 5 » e , if Ci <0, then the series will converge for all values 
of l between t 0 and + « since \e a,ll \ is decreasing with time, ap 
proaching zero m the limit Since each term approaches this 
limit, then 

lim x(t, a) = 0 (26) 

Therefore, if the system possesses first order stability, it necessarily 
Possesses stability in the small It is to be emphasized that this is 
an exact snJuf jod sn which no terms ore considered to be of an order 
of smallness and ignorable Likewise, if the system possesses first 
order instability, it must be unstable m the small However, the 
system may be m neutral first order equilibrium, and possess either 
stability or instability m the small In this case Cj vanishes, and 
we must consider the stability of the nonvamshing term of lowest 

*Not to be confused with stability of the first kind a concept employed m the 
Jious chapter 


arbitrary, 

a\ 

(23) 
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degree. This will be done later. We may state our results as 
follows: 

Theorem II: (a) First-order stability is a sufficient condition for 
stability in the small, (b) The absence of first-order instability is a 
necessary condition for stability in the small. 


Example: Logistic Law 

We may illustrate the above principles by considering the 
simplest nonlinear system 

t = A 0 +AiX + A 2 X 2 , (27) 

where 

Ax 2 - 4A 0 A 2 > 0, (28) 

if there are to be any "simple" stationary solutions. 

Without essential loss of generality, we may by linear trans- 
formation of X involving only translation and scale change, and 
by scale change in t, bring the above system into the form 

x = a;(l — #). (29) 

The equation 

x - x 2 = 0 (30) 


has roots 0 and 1, each of which represents a stationary state. The 
above differential equation will be recognized as one which is satis- 
fied by the Verhulst-Pearl-Reed logistic law, according to which 
percentage changes in a variable fall off linearly with the magnitude 
of that variable, approaching a limit asymptotically. The above 
equation, however, is slightly more general since it admits of solu- 
tions which are not 5-shaped. 

By quadrature its general solution is found to be of the form 

* = 1 + Ke~ l ' ^ 


where K is a parameter determined by the initial value of x at time 
zero according to the formula 


K = 


1 

x(0) 


- 1 . 


(32) 


For K = 0 we have the stationary solution 


x(t) s 1, 


(33) 
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and {or K - 00 we have the stationary solution 

*(0 * 0 ( 34 ) 

It can be easily verified that the latter stationary level is un 
stable, while the stationary level of unity is approached asymp. 
totically by all adjacent motions We may classify all possible 
initial conditions as follows 


+ 1 S x(Q) S oo, — 1 < If £ 0, 1 

0 < *(Q) SS 1, 0 %K < + 00 , j 

*(0) * 0, K ~ so, 

- 00 < *(0) <0 —oo < A < — 1, 


hm x{t) ~ I , 
r^O, 

lim x(t) = — oo 


(35) 


Let us now apply our expansion theorem to this problem * 
Expanding around the zero equilibrium point, and determining the 
c coefficients we readily find 

x - ae‘ — *** {cte‘1* H* ■* Y. ( — (36) 


For given values of t and sufficiently small values of a this is readity 
seen to be a convergent senes equal to the geometric senes expan 
sion of 


aC l 1 

i -f ae* ~ f 

1 + - e 1 


(37) 


But for large values of t 

|«£'| > !, 


(38) 


and the senes diverges This confirms our expectation that the 
zero equihbnum level is unstable 
The transformation 

y = x - 1 (39) 

enables us to apply our expansion theorem to the determination of 
the stability of the other equilibrium level Our differential equa- 
tion becomes 

y - - y - y\ (40) 


and in the new variable the equilibrium level equals zero De 


*Cf A J Lotka Elements oj Physical Biology (Baltimore, 1925) pp 64-68 
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termining the c's by substitution as before, we find 
y(t) - - {Ke ~ l } + IKe-‘} 2 - {Ke~ 1 } 3 + [Ke~<}* - ••• 

= £(- 1 Y\Ke-‘V. 

1 

This will be recognized as the formal expansion of 


y = 


1 + Ke~ l L 
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(41) 


(42) 


For small values of K and nonnegative values of t, this converges 
uniformly, and each term goes to zero as time becomes infinite. 
Thus, the equilibrium is stable in the small. 



This example also throws light on the domain of convergence of 
the series of exponentials. It is easy to see from the exact solution 
of (31) that the stationary level (y = 0, x = 1) is stable for all P os |" 
tive displacements from equilibrium, i.e., y(0) > 0, #(0) > 1, an 
also for all negative displacements which are less than unity 
in absolute value (— 1 < >'(0) <0, 0 < :e(0) < 1)- But t e 
series expansion of (41) is convergent only for |iT| < 1, or for 
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(— i < y(0) < + «,?< *(0) < «>) * Thus, its domain of con 
vergence is smaller than the region of true stability It provides 
only a lower limit which is not generally at the same time an upper 
limit to the region of stability 

In Figure 4 is depicted the solution of this equation for all 
possible initial conditions *(0) The stationary level of unity is 
seen to be stable, that of zero unstable The diagram brings out 
one feature not yet mentioned The lower and upper branches 
each approach an asymptote for finite values of t This means 
that for negative displacements around the zero level the system 
recedes from equilibrium at an infinite rate of speed after some 
finite time has elapsed For the upper branch it may be loosely 
said that the system approaches equilibrium after "coming in from 
infinity at an infinite velocity ” 

This example also suggests what is doubtless a valid "separa 
tion” theorem Points of stable equilibrium (in the small) are 
separated by points of definitely unstable equilibrium, and vice versa 
(In the interpretation of this, stable or unstable equilibrium may 
be of higher order so long as positions of one sided stability- 
instability are ignored ) 

The Problem of Higher Order Stability 
Thus far I have considered only "simple" stationary states, 1 e , 
those with power series expressions m which the first degree term 
does not vanish I now turn to "degenerate" stationary states 
those corresponding to multiple roots of the equation 

x = 0 = f(X), (43) 

where 

d = 0 (t = 0, ,71-1, nm 2) (44) 

In the neighborhood of such a root the differential equation takes 
the form 

x = a„x n + a B+ iX n+1 -f , (45) 

7 The expansion breaks down at the level at which the curve has an inflection point 
m = 0 = / {y(t)\y 

l venture the conjecture completely unverified that this may be a general phenomenon 
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t! g2. 

The solution of the equation 


x = a n x n , (46) 

where higher powers are neglected, can be expected by analogy 
with the previous case to dominate the remaining terms for suffi- 
ciently small initial displacements of equilibrium. In the investi- 
gation of simple systems this was analytically deduced (viz., 
Theorems I and II); for higher-order systems this conjecture has 
not been verified, but is doubtless true. We wish to investigate, 
therefore, the stability of solutions of equation (46). If they are 
all stable (unstable), we shall say that the equilibrium position in 
question possesses wth-order stability (instability). 

By elementary methods of integration we can find the exact 
solution to (46) ; thus 

x~ n dx = a n dt, 

x '-n = (1 _ n )a n t + x(0) 1_ ". ^ } 

To solve explicitly for x two cases arise depending upon whether 
n is odd or even. 


~ {a„(l - n)t + x{ O) 1 -"} 1 '*"-” 

1 

xW ~ ± («„( 1 - n)t + x(0) 1-n } ’ 


for n even ; 

(48) 

for n odd ; 


where the appropriate sign is to be taken so that x satisfies the 
initial condition x = x(0). Both cases may be subsumed under 
the heading 

x(t) = {sgn x(0) j 71 M1 _ n)t + x(0) i-n } mn^- (49) 


Clearly if n is even, ( n — 1) is odd. Therefore, the second 
term within the brackets takes on either sign depending upon 
whether the initial displacement is plus or minus. Consequently, 
regardless of the sign of a n , for some value of t, and for some dis- 
placement, the denominator will vanish, which means that x be- 
comes infinite, and the equilibrium is not stable. Actually for n 
even the equilibrium possesses one-sided stability-instability. If 
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*(0) is of the same sign as a n the motion will be unstable, if *(0) 
is of opposite sign to a K it w til bo stable Thus we have one-sided 
stability instability 

If n is odd then the second term of the denominator is always 
positive If and only if a n is negative (so that the first term is 
positive) will the denominator stay of the same sign and approach 
infinity as t approaches infinity Hence, vve have the following 
Theorem III If the first nonvamshtng coefficient is odd and nega 
hie, the system is stable in the small, if the first nonvanishing coefficient 
is odd and positive, the system is unstable If the first nonvanishing 
term is even the system possesses one sided stability instability 
This presents a strong analogy to the necessary second ir} con 
ditions for a maximum Write the differential equation 

A k = f(X) = F( X), (50) 

where 

F(X) = f" f(X)dX 

Then the following sums up the results achieved 

Theorem IV (o) If F(X*) affords a relative maximum to F, 
then X° is a stationary solution of the differential equation and 
possesses stabthty in the small, and conversely 

(6) If F(X*) affords a relative minimum to F then X° is an 
unstable equilibrium level 

(c) If X° is a stationary value of F(X°) which is not an extremum, 
then the system possesses one sided stability instability Allerna 
lively if F assumes a stationary value and F' = / assumes an ex 
tremum value, the equilibrium is stable unstable 

( d ) If F'iX) vanishes identically, the equilibrium is neutral 
This possibility of linking up the problem of stability with a 

statical maximum problem is but one special aspect of the Corre 
spondence Principle and one to which we shall have occasion to 
refer again 

An Example of One Sided Stability- Instability 
Malthusian and Optimum Population Theories 
The meaning of one sided stability instability may not be in 
tuitively obvious fortunately a simple well known economic ex 
pie may be used to illustrate it According to Malthus popula 
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tion would increase, decrease, or remain stationary depending upon 
the per capita level of subsistence (real income, food, etc.). Let 
X = total population, S = per capita real income. Then the 
percentage rate of growth of the population is an increasing func- 
tion of the level of subsistence, passing from negative values to 
positive values at some “minimum level of subsistence,” S°. 
Mathematically, 

X~dt = ( 51 ) 

where 

f'(S) > 0, *(s°) = 0. 

But the level of income (production) depends itself upon the 
level of population (labor) as of given capital, land, and technology. 
Moreover, Malthus implicitly and explicitly assumed the law of 
diminishing (per capita) returns. Thus, 

5 = <f>(X), <t>'(X) < 0. (52) 

This last relationship enables us to eliminate 5 as a variable and to 
express the rate of growth of the population in terms of itself. 

jfir '*{*(*) 1 =f{x) ’ (53) 

where 

f = W < 0 , 

and a stationary level X° corresponds to 

f(X°) = t(S°) = 0. (54) 

The equilibrium is stable because f(X°) < 0. If the popula- 
tion exceeds the equilibrium level, each family will receive less than 
the subsistence level, and the population will decline. If it falls 
below the equilibrium level (through war, etc.), the average income 
will be high, and the population will increase. In Figure 5 the 
minimum level of subsistence is shown by MM'. Below this level 
population will decrease as shown by the long arrow; above this 
level it will increase as shown by the other long arrow. R'R is the 
returns curve, and the intersection E\ represents a position of 
stable equilibrium as shown by the short directed arrows. 

More modem theory suggests the possibility that there may be 
increasing returns in the early stages. In this case there may be 
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two intersections between the returns curve and the minimum sub 
sistence level, yielding two points of equilibrium In the diagram 
the new returns curve is R"R, and the new intersection point is £, 
The new equilibrium point is unstable since here 

/ « > 0 (55) 

If population falls below this level it will become extinct since as 
returns fall off so does population etc , etc 



Figure 5 

About a decade ago the theory of an optimum population 
achieved a certain vogue According to one form of this theory, 
at some intermediate point average returns would be at a maxi 
mum If by education one could raise the minimum standard of 
comfort insisted upon by all families to this maximum level, then 
population would reach this optimum equilibrium level Without 
entering upon the merits or demerits of the scheme, I should like 
to point out that such an equilibrium level possesses one sided 
stability instability For displacements from equilibrium towards 
a larger population it is stable since such a movement lowers re 
turns and causes population to decrease towards equilibrium But 
for negative displacements of population it is unstable, since these 
also lower returns and cause population to decrease still more until 
the point of extinction is reached In Figure 5 the minimum level 
of subsistence (comfort) is irreversibly pushed up by education 
exactly to NN' so that its intersection, E 3 with R"R represents a 
maximum of per capita real income The arrows indicate the 
one sided nature of the equilibrium point 
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LINEAR AND NONLINEAR STABILITY 
Analytically, 

f = 0, (56) 

and the first nonvanishing derivative is even and negative. Thus, 
/ is at an extremum, and Theorem IV, (c), applies. The equi- 
librium is one-sided. 8 

Systems of Equations in “n” Variables 
Definition: The system of n differential equations 
ii = f(x i, • • •, x„) = Y. a ijXj + Z dijkXjXk 4- • • •, 

i 3>k 

(* = !,•••,») (57) 

where all summations range from 1 to n, will be said to possess a 
simple stationary solution (0, • • • , 0) providing that the matrix 
(an) possesses n (distinct, nonvanishing) roots (Xi, • • •, X n ), which 
are not connected by a linear commensurability relation of the form 

i\hi + * 2 X 2 + • • • + *nX„ = 0 (58) 

for any set of integers in • • • , i n , not all zero. 

In this case a fundamental set of solutions of the equations in- 
volving only linear terms 

n 

ii — E Q'ijXj (59) 

i= 1 

can be written in the form 

aie ui , a 2 e X!/ , • • * , a n e x "', (60) 

where the X’s are the latent roots of the matrix (&;,') or roots of the 
secular equation 

a 11 — X O 12 ’ ' ’ din 

®21 dll — x • • * c 2 n 

D(\) = = (Xi — X)(X 2 — X) • • • (X n — X). 

(61) 

dni a n 2 * * ■ a nn X 

8 I need not warn the reader against the lack of realism of the above theories in 
light of modem demographic trends. For large parts of Western Europe and North 
American where net and gross reproduction rates are low, perhaps no level of real income 
can lead to a stationary population. Moreover, these rates may fall with increasing 
real income; this, however, is presumably an irreversible effect. 
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Then we have the following theorem 9 
Theorem V A formal solution of the set of differential equations 
ts provided by a power series in the solutions of the first order equa 
tions, 1 e 

y = G (one?* cue™, a^) 

= £ c ,(«/ ') + £ e„*W x +x *’' + () 

! 1 k 

By formal substitution this can be verified remembering that the 
noncommensurability relationship (58) is satisfied Each set of c s 
can be determined in terms of previous sets and the known a s 
The matrix a may have (pairs of) complex roots corresponding 
for linear systems to damped or undamped sine cosine terms The 
system will be said to have first order stability if the real parts of 
all roots real or complex are all negative since this will imply 
damped motion (exponential or harmonic) of the linear system 
An important part of the stability problem is the determination of 
necessary or sufficient conditions that all real parts are negative 
and this will be dealt with in Mathematical Appendix B 

Here I should like to mention the generalization of Theorem II 

Theorem VI (a) First order stability is a sufficient condition for 
stability in the small, (5) the absence of first order instability is a 
necessary condition for stability in the small 

This follows because the series (62) can be shown to converge 
for all values of t and values of a sufficiently small if all the real 
parts are negative Since it converges for limited values of a and /, 
and since all terms are decreasing m absolute magnitude it con 
sequently never ceases to converge 

A stationary equilibrium position is not 4 simple if a com 
mensurabihty relationship of the form mentioned holds Even if 
such a relationship does exist, the above theorem is valid providing 
there are no zero roots If no roots vanish linear commensura 
bihty relations introduce into the infinite power series terms of 
the form 

p*(t)e ( x + +** x * + * (63) 

where p„(t) is a polynomial If the real parts of all X's are negative, 
the exponential will dominate and the solution will still be stable 
•Cf Picard op at BirkhofT op cil 



LINEAR AND NONLINEAR STABILITY 301 

Zero roots or ones whose real parts are zero, i.e., pure imagi- 
naries, cause greater difficulties since stability in the small becomes 
dependent upon terms of higher degree. 1 am not aware that this 
has been completely analyzed in the mathematical literature except 
for special cases. I shall not, therefore, enter upon the problem 
except to prove a general theorem relating to those many variable 
systems associated with the maximum of some function. 

Before doing so I may summarize briefly the results achieved 
so far as follows: Stability in the small of a nonlinear system of 
differential equations depends except in singular cases upon the 
stability of a linear system. This dependence can be rigorously 
defined and does not involve a dubious neglect of the squares of 
small quantities, etc. 

The Stability of a Stationary Position 
WHICH IS ALSO A MAXIMUM 

If (Xi° , • • • , X n °) yields an isolated relative maximum to a twice 
differentiable function F(X 1 , • • •, X n ), then it is not hard to show 
by the theorem of the mean that 

r) F 

= FAX ,• • • • , X„‘) = 0 , (i = 1 , • • • , ») ( 64 ) 

and, assuming we have an isolated stationary value, 

F 1 (X 1 , ■■;Xn)(X i - X 1 ") 

+ Ft(Xu • • • , Xn){Xt - x 2 °) + • • • <0 ( 65 ) 

for values of X sufficiently close to X°, but distinct from it. 

Suppose we are given a system of differential equations 

~ = /’(X„ •••,!„) = Wi, • ' •.*«)• ( 66 ) 

Only special differential systems can be so written. Unfortunately 
space does not permit a discussion of necessary and sufficient condi- 
tions satisfied by such special systems. 

Theorem VII: (X t °, • • •, X n °) is a stationary solution for the 
above system, and it is stable in the small. 
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Transforming the equilibrium point to the origin (0, , 0), 

we have 

^ = Ft tSf,” + , X,- + *„), (67) 


and £ F%x, < 0 for sufficiently small nonzero values of x 


Multiplying the first equation of (67) by Xu the second by x i( 
etc , and summing we find 


Lx, 


dx f 



± F,x, < 0 

i 


( 68 ) 


For sufficiently small values of x, the sum of squares is decreasing 
Hence as t goes to infinity it approaches a limit which cannot be 
different from zero If 


hrn (x, 1 + x 2 * + + x n 2 ) = 0, 


(69) 


then 

lim xi = 0 Iim x 2 = 0, 


lim x n = 0 (70) 


Hence, the equilibrium is stable A relative proper minimum 
yields definitely unstable equilibrium, while a nonextremum sta 
tionary value yields stability instability 

This theorem while not applicable to all differential equations, 
is nevertheless very important for economic systems Within its 
scope it is exceedingly general since it does not require that the / s 
be analytic, and it covers simultaneously the stability of first and 
higher orders Economically it says that the system is always 
climbing a hill and will approach the top 

The Dieeerence. Equ&xuiv. in qne. Yabiablr 
The problem of differential systems has been analyzed in a fairly 
complete manner, and we must now turn to systems of difference 
equations, which are perhaps of even greater importance for eco 
nomic theory The simplest case is provided by the general non 
linear difference equation in one variable 

X{t + 1) =f{Xm, (71) 
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or 

AX(t) = g[X(t)} =f- X(t). (72) 

So simple is this equation that we can indicate its solution 
graphically, showing on one diagram all possible types of qualita- 
tive behavior in the neighborhood of an equilibrium position. In 
Figure 6 two functions are plotted — one relating the succeeding 



value of X (vertical axis) to the given value of X (horizontal axis) ; 
the other is simply a 45° line. 

The solution of our equation for any initial condition is shown 
by broken lines running between these two functions in the indi- 
cated fashion. Any initial value X ( t ) leads to a new value X(t + 
indicated by running up vertically to the curve f{X(t)} ; this 
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value must be transferred to the abscissa in order to derive its 
successor This is effected by moving horizontally to the 45° line , 
a vertical movement yields the next value, and so the process goes 
Stationary equilibrium positions are defined by the intersection of 
the / function and the 45° line, or analytically by roots of the 
equation 

f(X) - X - 0 (73) 

The diagram illustrates what are essentially all possible types 
of equilibria Twenty four points of equilibrium are indicated 
but only sixteen represent qualitatively different hinds of equi 
librium, eight being duplicates Point A represents a position of 
equilibrium stable m the small Displacement in either direction 
results m an asymptotic return to equilibrium Point B repre 
sents unstable equilibrium displacement is followed by an ever 
increasing one way disequilibratmg movement C represents a 
position of stable equilibrium differing from A in that it has first 
order neutrality and has only high order stability D possesses 
first order neutrality but high order instability therefore it is 
unstable in the small, and the recession from equilibrium is mo 
notonic E has first order neutrality but its next nonvanishing 
derivative is e\ en Consequently, it has one sided stability 
instability 

Thus far we have met only equilibria like those of the one 
variable differential system That the single difference equation is 
richer in types of equilibria is illustrated by the variety still to 
come F possesses first order stability and hence is stable in the 
small Unlike A the approach to equilibrium is not monotonic, 
but rather by means of damped oscillations of period two G 
represents simply a position of neutral equilibrium in the small, 
possessing first and high-order neutrality The system stays 
where\er it is put H differs from F only m that the equilibrium 
is unsta6l*e, the divergence from equifibnum being along explosive 
oscillations, I is simply a duplication of B, as are K, M, 0, Q, S, 
U , and W 

J represents a position of neutral equilibrium The system 
oscillates with constant amplitude around this position (In phys 
ics this would be called stable equilibrium In the terminology of 
the previous chapter, it has stability of the second kind — not to be 
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confused with second- or high-order stability.) L like J possesses 
first-order neutrality, but unlike J has high-order stability. Equi- 
librium is approached by damped oscillations. N also has first- 
order neutrality, but possesses high-order instability of an oscilla- 
tory nature. P and R like L and N possess high-order, oscillatory 
stability and instability, respectively, but analytically differ slightly. 
T possesses oscillatory neutrality of all orders, and hence is neutral 
in the small. It can be thought of as including J as a special case. 
Finally, V and X, while they possess first-order stability, are 
analytically of singular type and should be differentiated. The 
former has monotonic stability, the latter oscillatory stability. 
The following classification may clarify the possible types. 

{ Nonsingular A 

Singular V 
High-order C 
f First-order B 

Monotonic' Unstable - 

High-order D 
Stable-unstable E 


Neutral G 


Stable 

Oscillatory 

Unstable 


f Nonsingular F 
First-orderi 

[ Singular X 
f Type 1 L 
High-orderj 

l Type 2 P 
First-order II 

Type 1 N 

High-order- 

Type 2 R 


Neutral J, T 
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We discriminate analytically between the sixteen cases depend 
ing upon the first and higher derivatives of f(X) The first sub 
division between monotonic and oscillatory hinges upon whether 
f(X) is positive or negative Within the monotonic classification 
we ha\e first order stability if /' is less than unity and first order 
instability if /' is greater than unity If /' equals unity, we have 
first order neutrality and must proceed to higher derivatives If 
all these derivatives vanish we have true neutrality in the small 
tf the first nonvamshmg derivative is odd and positive, we have 
high order stability, if odd and negative, we have high order in 
stability If the first nonvanishing higher derivative is even, we 
have one sided stability instability 

Within the oscillatory classification things are even more com 
plicated If /' is less than unity in absolute value, we have first 
order stability, if greater than unity in absolute value, we have 
first order instability If /' = — 1, then we must proceed to 
higher derivatives Should all of these vanish we have neutral 
oscillatory equilibrium (7) More generally, if all odd higher 
derivatives vanish, the equilibrium is neutral and oscillatory as 
in T When /' = — 1 and the first nonvanishing derivative is 
odd and positive, the equilibrium possesses high order oscillatory 
stability if odd and negative, the equilibrium possesses high order 
oscillatory instability When/' = — 1, and the first nonvanishing 
derivative is even then we go on to consider the first nonvanishing 
odd derivative As in the previous cases, we have stability or 
instability depending upon whether this is negative or positive 

Finally, we are confronted with the singular cases when/' = 0 
If the next nonvamshmg derivative is odd and positive, we have 
monotomc stability, if odd and negative, or if even and of any sign, 
we have stable oscillatory equilibrium If all derivatives vanish, 
the equilibrium is perfectly stable as indicated m the first section of 
this chapter, when displaced the system returns “instantly ’ to equi 
libnum and does not merely approach equilibrium asymptotically 

A few remarks may be in order concerning the qualitative be 
havior of a first-order system from any initial condition It could 
no doubt be shown that it must do one of the following (a) go off 
to infinity, (b) approach an equilibrium level, or (c) approach a 
periodic motion of some finite period If it is reversible, that is 
if f(X) is not only single valued but admits of a single valued m 
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verse, then the only periodic motion possible under heading (c) is 
one of period two. It is not appropriate to investigate here the 
meaning of the stability of motions more general than stationary 
equilibrium levels. When this is done, it will be found no doubt 
that there exist valid “separation” theorems concerning the order- 
ing of stable and unstable periodic motions. The necessity because 
of continuity (Rolle’s theorem, etc.) of duplicating certain equi- 
librium points in the above diagram provides a hint at such 
relationships. 


Analytic Solution 

As with the differential equation systems an exact solution of 
the general nonlinear analytic difference equation can be indi- 
cated. Let 

x(t + 1) = ai x(t) + a 2 x(t) 2 + a 2 x(t) 3 + • • •, (74) 

or 

A x(t) = («i — 1 )x(t) -f- a 2 x(t) 2 -f a 3 x(t) s -}-•••. (75) 

We rule out cases where the first-degree term vanishes in either of 
these expressions; and we rule out all cases of singular and neutral 
first-order equilibrium so that oj ^ 1, 0, or — 1. 

A formal solution of this is given by a power series in the solu- 
tion of the simpler linear equation 

x(t T 1) = ®i %(l) > (76) 

A x(t) = (ai — l)x(t). ' 

Let 

gi(t) = aaf. (77) 

Theorem VIII: A formal solution to (74) is provided by 

x(t) = Ci{aai‘] + c 2 {q;Ox ( } 2 + Cslaaf} 3 + (78) 

This may be verified by formal substitution provided that oq 5^ 1, 0, 
or — 1. Each c is obtainable in terms of all previous c s and the 
known a’s. 

More generally, we are given n difference equations in normal 
form 

xft + 1) = 2 ebijXj(t) -j- D &ijkXj(t)xk(t) + • ■ • . (79) 
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where the latent roots of a, (Xj, , A,), are never equal in absolute 
value to zero or one, and where there exist no linear commensura 
bihty relations of the form 

nit log Ai + tnt log Xj + + m n log A* = 0, (80) 

for in's integers not all vanishing Then 

Theorem IX A formal solution to the system of difference equa- 
tions is given by an ascending power senes in the solutions of the 
linear system 

x t (t + 1) = Z (81) 

or 

*•(0 «=* • <* nW ) 

= + Lcu*l«,x/M«*x*‘l + K ; 

t j * 

Each set of c's can be determined by formal substitution from all 
previous sets and the known a’s If some of the roots are multiple, 
or if a linear commensurabthty relation of type (80) does hold, 
there will probably exist a similar power-series solution augmented 
by terms with polynomial multipliers in t, provided always that the 
absolute value of all roots equals neither zero nor one 

Alt of the remarks of earlier sections concerning convergence of 
such series apply We must remember, however, the first order 
stability of a difference equation system implies 

|X,| < 1, (83) 

and conversely The following theorem is easily derived along 
now familiar lines 

Theorem X For a difference equation system first order stability 
is a sufficient condition for stability in the small, and the absence of 
first order instability is a necessary condition 

Space does not permit me to illustrate the above remarks with 
an economic example, such as would be provided by the familiar 
cobweb theorem applied to nonlinear supply and demand curves 

Other Functional Equations 

While this has not been verified, one can hazard the conjecture 
that more general nonlinear functional equations can m the neigh- 
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borhood of an equilibrium point be expressed as a power series in 
solutions of the simpler linear systems. Thus, under suitable as- 
sumptions functional equations can be written in the Taylor-like 
expansion 

*.«) + WiV), ••■,X n '(r)S 


= (X,(/) - X/} + Z P K l} (t - r){X 3 (r) - X,o\dr (84) 

J -02 

+ e r r K 1JS (t ti, t — r 2 ){z 3 (n) - x ; °} 

^-oo */— 00 

(X 8 (t 2 ) — X 4 °}^Ti^T 2 + • • •, 

where K l} , K lja represent first and second functional derivatives 
respectively. The linear system 

X t (t) - X t ° + L f Kij(t - t ) - X,*\&r = 0 (85) 

j t/-co 

is known to have solutions of the form 

X&) - X t ° = £ a 1J e > "', (86) 

1 

where the X’s are infinite roots of the transcendental equation 

D(\) = \8 tj + K tJ (v)e-*’dv | = 0. (87) 

•'O 

In population analysis single integral equations of this type play an 
important part, and the solution in terms of an infinite number of 
exponentials has been called by A. J. Lotka 10 the Hertz-Herglotz 
solution of these equations. It can perhaps be shown that an 
infinite power series in the infinite solutions of the linear system 
will provide a solution for the nonlinear case. 

Similarly mixed equations of the type 

y'(t) = f{y(t), y(t - 0) } (88) 

may be expressible in terms of the Frisch-Holme linear equation 
y'(t) = ay(t) + by(t — 0). (89) 

10 For an excellent bibliography of applications see A. J. Lotka, “A Contribution 
to the Theory of Industrial Replacement,” Annals of Mathematical Statistics, vol. X 
( 1939 ). 
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As in the preceding integral equation, boundary or initial con- 
ditions for this equation necessarily involve an arbitrary function 
over an interval, and hence the exponential solutions must be in- 
finite in number so that an arbitrary function can be expanded in 
terms of them. (The same would be true of difference equations 
not solely defined for integral values of t. Arbitrary periodic func- 
tions would enter into the solution, and these would be expressible 
in the Fourier infinite exponential series.) This raises problems of 
doubly infinite series and cannot be entered upon here. 



CHAPTER XI 

SOME FUNDAMENTALS OF DYNAMICAL THEORY 

In the previous two chapters I have examined a number of eco- 
nomic examples of historical interest drawn from the fields of theory 
and business cycles, and have attempted to suggest the importance 
of dynamical analysis not only for its own sake, but as an aid in 
arriving at useful theorems in the realm of comparative statics. 
Now with less use of illustrative economic material, I should like 
to examine analytically the formal aspects of this problem, at the 
same time indicating possible directions of generalization. 

Statics and Dynamics 1 

Often in the writings of economists the words “dynamic” and 
“static” are used as nothing more than synonyms for good and bad, 
realistic and unrealistic, simple and complex. We damn another 
man’s theory by terming it static, and advertise our own by calling 
it dynamic. Examples of this are too plentiful to require citation. 

Some writers attempt to distinguish between statics and dy- 
namics by analogy with what they understand to be the relation- 
ship in theoretical physics. That this is a fruitful and suggestive 
line of approach cannot be doubted. But it is too much to suppose 
that very many economists have the technical knowledge necessary 
to handle the formal properties of analytical dynamics. Conse- 
quently, they become bogged down in the search for economic con- 
cepts corresponding to mass, energy, inertia, momentum, force, 
and space. A case in point is Professor Frank Knight’s otherwise 
stimulating essay on Statics and Dynamics . 2 

It is certainly true, notably in the writings of Marshall , 3 that 
economists have made use of biological as well as of mechanical 

1 With a few alterations the following section is taken from my article, "Dynamics, 
Statics, and the Stationary State,” Review of Economic Statistics, XXV (1943), 58-61. 

2 Chap, vi of The Ethics of Competition (New York, 1935). This is an English 
translation of an article in Zeitschrifl fiir Nalionalokonomie for 1930. 

3 See the references to statical method, and biology in the Index to the eighth edition 
of the Principles. In none of his writings does Marshall show more than a passing 
familiarity, such as might be expected of any intelligent layman, of the biological notions 
of his time. Therefore, he could not be expected to have discerned the lasting truths 
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analogies, in which evolution and organic growth is used as the 
antithesis to statical equilibrium analysis. In general the results 
seem to have been disappointing; viz., the haziness involved in 
Marshall’s treatment of decreasing cost. And if one examines the 
more exact biological sciences, one looks in vain for any new 
weapon, secret or otherwise, for discovering scientific truths If 
the bloodstream is capable of a simple, abstract, rigorous descrip- 
tion in terms of the usual laws of physical thermodynamics, so much 
the better, if not, one must be content with more complicated, un- 
wieldy explanations. Indeed, according to the late L J . Henderson 
the very notion of a stable equilibrium, so characteristic of physical 
theory, was actually first observed empirically in connection with 
the resistance of the human body to disease, and formulated by 
the ancients as the well-known mx medicalnx naturae 4 

Nor should the problems encountered in the biological field be 
considered necessarily more complex and less subject to simple 
formulation than those in the physical field Few biological 
sciences are less "exact” than meteorology, which must certainly 
be included in the physical sciences . 6 Here, simple and abstract 
theories spun out from a few assumptions are likely to be inferior 
to the intuitive hunches of experienced practitioners, but this is 
only a reflection of the present primitive state of the subject. 
New truths are ascertained in the same way as in more advanced 
subjects, and it is to be hoped that rule of thumb may be replaced 
by more exact and unambiguous formulations 6 

from the fashions of the moment Nevertheless, writing at the time that he did, it was 
inevitable that he should have been influenced, if not convinced, by the Spencerian 
doctrines popular at the end of the nineteenth century. 

* See J. A Schumpeter, The Theory of Economic Development (Cambridge, Mass , 
1934, English edition), Preface, p xi, for Mill’s notions of statics and dynamics and their 
intellectual origins 

* Of course, it can be said that experimentation is not possible in meteorology as in 
other physical sciences But what about astronomy, in some ways the most exact of all, 
in which no experimentation is possible? 

* In discussing the limitations of mathematical methods in economics, Professor 
Viner expresses the belief that the biological character of the subject, so to speak, makes 
such methods of limited applicability By this I take him to mean that the subject is 
complex and difficult, not that any fundamentally different methods of investigation are 
required. See “Marshall's Economics, the Man and His Times,” American Economic 
Review, XXXI (1941), 223-236 

Gustav Cassel in his Fundamental Thoughts tn Economics (New York, 1925), 
chap i, considers Economic Dynamics to be a third stage of analysis, following a pure 
Static Economy and a “quasi-static” Uniformly Progressive Economy 
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Leaving aside all analogies with other fields, there has neces- 
sarily been within the main corpus of economic theory a preoccupa- 
tion with dynamics, if only implicitly. The Classical economists 
from Smith through Mill had theories of the long-run movements 
of population and accumulation . 7 J. B. Clark rigidly separated 
the static from the dynamic in his thought . 8 (Other examples 
could be multiplied.) Clark’s celebrated static state and the 
“circular flow” of Professor Schumpeter raise a rather vexing point 
of terminology — the relation of static to stationary — now cleared 
up more or less to the satisfaction of everybody by Professor Frisch. 

Stationary is a descriptive term characterizing the behavior of 
an economic variable over time; it usually implies constancy, but 
is occasionally generalized to include behavior periodically repeti- 
tive over time. Used in this sense, the motion of a dynamical 
system may be stationary: e.g., the behavior of a pendulum satis- 
fying Newton’s Laws of Motion, but subject to no disturbance and 
hence remaining at rest; or the behavior of national income after 
a change in investment has given rise to dwindling transient geo- 
metric progressions of the usual “block-diagram” character. 

Statical then refers to the form and structure of the postulated 
laws determining the behavior of the system. An equilibrium de- 
fined as the intersection of a pair of curves would be statical. 
Ordinarily, it is “timeless” in that nothing is specified concerning 
the duration of the process, but it may very well be defined as 
holding over time. A simple statical system as defined above 
would also have the property of being stationary; but, as we shall 
see in a moment, statical systems can be devised which are not 
stationary over time. 

In defining the term dynamical, at least two possibilities suggest 
themselves. First, it may be defined as a general term including 
statical as a special rather degenerate case. Or, on the other hand, 
it may be defined as the totality of all systems which are not statical. 
Much may be said for the first alternative; the second, however, 
brings out some points of controversy in the literature and will be 

7 See L. Robbins, “On a Certain Ambiguity in the Conception of Stationary Equi- 
librium,” Economic Journal, XL (1930), 194-214. 

8 J. M. Clark has wished to carry on where his father left off, to construct a dynamics 
which would supplement statics. See J. M. Clark, A Preface lo Social Economics (New 
York, 1936). 
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discussed here This decision mvoh es no point of substance since 
only verbal problems of definition are involved 

We may say that a system is dynamical if its behavior over lime 
is determined by functional equations in uhich variables at different 
points of time are iniohed m an essential nay This formulation 
is to be attributed to Professor Fnsch • Special examples of such 
systems are those defined by difference equations 1 c those in 
volving a variable and its lagged \ alues integral equations in which 
the preceding values of the variable enter m a continuous wa> 
By a liberal interpretation of the circumlocution \anable at a 
different point of time we may bring differential equations under 
the definition remembering that differential coefficients character 
ize the behavior of a function in the neighborhood of a point 
Mixed types and more general functionals arc included 

Attention is called to the fact that a ambles at different points 
of time must enter into the problem in an essential wa> Thus 
a system involving a rate of production per unit time 1 c a time 
derivative may >et be statical This is because the variable of 
which the rate is the time derivative may have no economic sig 
mficance It can be interpreted as the cumulated amount of pro 
duction from the beginning of time or from an initial date no 
significant economic process depends upon this variable The nc 
cessity for the present insistence may be apprehended if it is realized 
that every variable can be written as the derivative of something 
namely its own integral Moreover a system may be pseudo 
dynamic in the sense that formal manipulation of it permits us to 
reduce it to statical form Unless therefore we reserve the dcsig 
nation dynamics for systems which involve economically significant 
variables at different points of time in an irremovable way we shall 
find that no nondynamic systems exist 

According to the present definition the historical movement of 
a system may not be dynamical If one year the crop is high be 
cause of favorable weather the next year low and so forth tl e 
system will be statical even though not stationary The same is 
true of a system show mg continuous grow th or trend if the secular 

’ Ragnar Fr sch On the Not on of Equ 1 br um and D sequ 1 brium Ret «t> of 
Economic Studies III (1935 36) 100-106 
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movement is taken as a datum and if the system adapts itself 
instantaneously. 10 

On the other hand, a truly dynamical system may be com- 
pletely nonhistorical or causal, in the sense that its behavior de- 
pends only upon its initial conditions and the time which has elapsed, 
the calendar date not entering into the process. For many pur- 
poses, it is necessary to work with systems which are both historical 
and dynamical. The impact of technological change upon the 
economic system is a case in point. Technological change may be 
taken as a historical datum, to which the economic system reacts 
noninstantaneously or in a dynamic fashion. Another instance 
is provided by a business cycle of a regular periodic character, 
which results from impressing an oscillatory outside force upon a 
mechanism with an intrinsic (damped) period of its own. 

We may distinguish, then, four distinct cases made up of all 
possible combinations of static-dynamic and historical-causal: 

1. Static and stationary 

2. Static and historical 

3. Dynamic and causal (nonhistorical) 11 

4. Dynamic and historical 

Almost all systems can be placed in one of these categories; and, 
depending upon the point of view or purpose at hand, the analysis 
may be formulated so as to put a given system arbitrarily in one 
category rather than another. Thus, if a system is very heavily 
damped so that it approaches its equilibrium value extremely 
rapidly, its dynamic features may be passed over in order to 
simplify the analysis. 

Or a system which is causal from a very broad viewpoint may 
be regarded as historical if certain movements are taken as unex- 
plained data for purposes of the argument. (In fact, every his- 
torical system is to be regarded as an incomplete causal system.) 

10 1 conceive Henry Moore’s moving equilibrium to be of this statical type, although 
the movements around the secular trend are dynamic in character. H. L. Moore, 
Synthetic Economics (New York, 1929). 

11 The notion of causation in a closed interdependent system is exceedingly slippery 
and ambiguous. As used here, a system is said to be causal if from an initial configura- 
tion it determines its own behavior over time. While it is not appropriate to say that 
one subset of variables causes another to move, it is permissible to speak of a change in 
a given parameter or datum as causing changes in the system or in its behavior over time. 
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To a meteorologist economist a business cycle caused by weather 
disturbances and sunspots would constitute a causal process But 
ordinarily the economist is willing to regard the causation as um 
lateral and to adopt a division of labor in which he does not study 
astronomy but considers his job as done when he has pushed 
economic analysis to a “noneconomic" cause 12 However, there 
is nothing sacred about the conventional boundaries of economics, 
if the cycle were meteorological in origin, economists would branch 
out in that direction, just as in our day a political theory of fiscal 
policy is necessary if one is to understand empirical economic 
phenomena 

An important class of phenomena cannot conveniently be 
brought under the above four categories I refer to dynamical 
stochastic processes such as that realized if a damped pendulum is 
subjected to "random’ shocks We shall have reason to discuss 
such processes in connection with dynamical problems arising in 
the study of the business cycle 

For a given pattern of shocks as determined by the particular 
workings of chance in the sequence of time under discussion, we 
have simply a dynamic historical system of type 4 above But if 
we consider the totality of all possible shocks which may be ex 
pected to occur if they are regarded as random draws from a fixed 
universe, it is clear that calendar time is not really involved, but 
only the time which has elapsed since the beginning of the process 
In this sense it is like type 3 rather than type 4, although the word 
causal may no longer seem appropriate 

It seems best, therefore, to specify two more categories 

5 Stochastical and nonhistoncal 

6 Stochastical and historical 

The latter occurs when we have a dynamical system containing 
stochastical variables and where either the structure of the system 
vanes m an essential way with time, or where the universes charac 
tenzmg the random vanables change in an essential way with tune 
Simple examples of types 5 and 6 are provided by 

Xt+i = \Xt + {Ail. 

Xi+i = *Xt + / + 

** J A Schumpeter Theory of Economic Development chap i 


( 1 ) 
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where in the first case the random variable h which may appear at 
any instant of time is drawn from an unchanging universe; and 
where in the second case the random variable m is drawn from a 
universe defined differently at each instant of time. 13 

In his recent book Value and Capital , u Professor Hicks has 
given an exceedingly simple definition of dynamics: “I call Eco- 
nomic Statics those parts of economic theory where we do not 
trouble about dating; Economic Dynamics those parts where every 
quantity must be dated” (p. 115). 

In terms of the above six categories this definition is overly 
general and insufficiently precise. The second category consisting 
of historically moving static equilibria would certainly require 
dating of the variables; but it would not thereby become dynamic. 
My objection is to his definition, not to his practice, for many of 
the systems which he analyzes are in the strict sense dynamic. 

Causal Systems 

By a complete causally -determinate system I shall mean one be- 
longing to the third category of the above classification, whose be- 
havior is determined by initial conditions (in the broadest sense) 
in such a way that its behavior depends only upon the time which 
has elapsed since the establishment of such initial conditions. 
That is to say, the specification of similar given initial conditions 
at a later period of time would result in a similar evolution of the 
system except at a constantly later time period. Mathematically, 
if x represents initial conditions at time t°, our solution takes 

13 In the above form the initial formulation of stochastic processes seems to be due 
to G. U. Yule. See the references to the work of Yule, Slutsky, and Frisch in H. Wold, 
A Study in the Analysis of Stationary Time Series (Uppsala, Sweden, 1938). However, 
the notions of a Markoff Chain, a Brownian movement, etc., go far back in the literature 
of mathematics. See S. Chandrasekhar, “Stochastic Problems in Physics and Astron- 
omy,” Reviews of Modern Physics, XV (1943), 1-89. 

As is noted later, many systems which involve stochastical processes can neverthe- 
less be described in part by nonstochastical models of the first four types discussed above. 
Often probabilities and statistical parameters may themselves be treated as causally 
determined coordinates. Thus, if it is known that the first system described in equation 
(1) takes on the value k at time 0, and that the mean and standard deviation of h are 
respectively a, b, then the expected (mean value) of Xt is given by the solution of a 
causal dynamical equation like that of (1) with the stochastic term removed. 

11 J. R. Hicks, Value and Capital (Oxford, 1939). 
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the form 

( 2 ) 

where x and x stand for a finite or even infinite set of variables 
Without this restriction our system would be a historical one 
depending in an essential way upon the time at which the initial 
conditions are specified, and could only be written 

(3) 

Thus, the differential equation 

x + * = 0 (4) 

does not contain time explicitly, and its solution is of the form 

x(0 *=» (5) 

while 

x 4- x = t (6) 

has the solution 

*(/) = (* - 1) + [x(r) + 1 - (7) 

The first is a complete causal system , the second is not. For 
the first a specification of the initial conditions (value of f) and 
a knowledge of the time which has elapsed is sufficient to locate the 
quantity x, in the case of the second it is necessary as well to know 
the historical date at which the initial conditions are preassigned. 16 

11 This represents a one parameter, not a two parameter, family of curves A change 
in f introduces no new solutions, but simply alters the initial conditions l{l°) which 
identify each 

“Complete causal systems are essentially unchanged by the transformation 

f = t + a, 

but become historical ones in the time scale defined by the transformation 

f - *(0, h"(t) ^ 0, 

although for simple scale changes dimensional constants serve to maintain all essential 
invariances Conversely, m special cases there may exist transformations (of time) 
which convert a historical system into a causal one, via , the historical system 



becomes after the transformation 

t — e'', where 1' = log t 
a complete causal system in the new time t , inmely. 
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It should perhaps be explicitly mentioned that no metaphysical 
significance attaches to this definition. Not only is an arbitrary 
time scale involved, but also any historical system may be regarded, 
as we shall later see, as an “incomplete” causal one. Furthermore, 
the elements of our system may very well be probabilities, which 
are themselves determinate although the value of certain variates 
may be uncertain. 

Merely from the definition of a causal system certain interest- 
ing properties become obvious. If within a finite period of time a 
system returns to exactly the same initial conditions from which it 
started, its motion must be perfectly periodic, since it must again 
do the same, etc . 17 Then again, a determinate economic system 
cannot on the downswing pass through precisely the same con- 
figuration as it did on the upswing. Some of the relevant variables 
(costs, prices, etc.) must present a different pattern or else relevant 
dynamic aspects of them must (time derivatives, lagged values, etc.). 

Let us now consider some definite causal determinate systems. 
An investigator interested in only certain aspects may wish to 
concentrate upon a particular subset of variables, neglecting their 
mutual interdependence with the totality of all. To a first ap- 
proximation the excluded ones may be assumed to be arbitrarily 
given functions of time. The system studied is essentially an “in- 
complete” causal one and as such is indistinguishable from any 
other historical one. Thus, given three bodies, one of large mass 
compared to the other two and of great distance from them, it is 
often convenient to analyze the behavior of the two smaller ones 
on the assumption that they are in a gravitational field changing 
with respect to time (as the external body pursues its “relatively” 
independent path) ; although in fact the three bodies together form 
a gravitational field independent of time. 

Which variables will be taken as data, and which as unknowns 
to be analyzed, will depend in each case upon the purpose at hand 
and upon a diagnosis of the particular interrelations present. 
Often the economist takes as data certain traditionally noneco- 
nomic variables such as technology, tastes, social and institutional 
conditions, etc. ; although to the students of other disciplines these 
are processes to be explained and analyzed, and are not merely 

17 G. D. Birkhoft and D. C. Lewis, Jr., "Stability in Causal Systems,” Philosophy 
of Science, II (1935), 304-333. 
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history 18 On the other hand one may limit the scope of one’s 
inquiry within the economic sphere Thus, in studying a hypo 
thetical short m\ entory cycle one might take longer w aves as given 
Then again, it is often necessary to go beyond the realm of tradi 
tional economic variables m throwing light upon a particular proc 
ess especially often, for example, into the political sphere The 
w orld rarely fits into the taxonomic classifications of pedagogues 

Among the numerous reasons for electing to regard certain 
■variables as independent or arbitrary parameters, one in particular 
deserves attention in connection with dynamic analysis Certain 
processes are conceived to move slowly as compared to others 
Thus we distinguish long run tendencies from shorter run tenden 
cies, and so forth in infinite regression The interplay of these 
processes will be given in more detail later 

Stationary States and their Generalization 

In chapter ix a stationary or equilibrium state of a dynamic 
system of n variables defined by n functional equations of the 
general form 

P[xi(r), , x» (r), *3 = 0 (1 = 1, , n) (8) 

was provided by a set of constants (xi®, , x*°) for which 

,xl°,/) = 0 (9) 

Even if the system is not a complete causal one, 1 e , if it m 
volves time explicitly, it may still possess one or more stationary 
equilibrium levels Thus, the system described m footnote 16, 
p 318, is historical, but has the stationary equilibrium level x s 0 
However, only m exceptional cases will a historical system possess 
stationary equilibrium positions, and as pointed out in chapter ix, 
even complete causal ones may lack such positions It is desirable, 
therefore, to consider the manner in which a stationary equilibrium 
can be generalized 

If the variables of our system are "dated,” and involve time 
explicitly, but are not truly dynamic in the Frisch sense of involv ing 
variables at different points of time (derivatives, integrals, and 

u See J A Schumpeter, The Theory of Economic Development chap 1 
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more complicated functionals), the manner of generalization is 

obvious. Our historic (but Frisch-static) system is of the form 

-F(-X) t) F (# 1 , • • x n , t ) = 0. 19 ( i = 1, • ■ • , n) (10) 

A solution of this system 

X = X‘{t) (11) 

such that 

0 = 0 ( 12 ) 

may be termed the moving equilibrium. However, this is almost 
trivial since the moving equilibrium is the only possible evolution 
of the system. 20 Let us turn, therefore, to truly dynamic historical 
systems. 

For definiteness, let us contemplate the dynamical system as- 
sumed by H. L. Moore in his pioneering attempt to determine em- 
pirical supply and demand elasticities. While he adopted a general 
equilibrium approach in which the quantity of each good depends 
upon all prices, the formal difficulties of the problem will be lessened 
if we concentrate upon a partial equilibrium market involving only 
one good q and its price p. 

The demand for the good is a functional relationship between 
its quantity, its price, and time, 

2* = Dipt, t). (13) 

Quite inadvertently, and probably for the purely statistical purpose 
of deriving the elasticity of supply simultaneously with the demand 
elasticity, Moore assumed that the quantity supplied depended 
upon time and price at a previous period vf time, 

qt = S{pt-i, t ). (14) 

The two equations together form a dynamical system which 
determines the evolution of ( p , q) for given initial values of p or q; 

13 1 shall have frequent recourse to matrix notation whereby a set of variables 
(xi,---,x n ) will be represented by (X), a set of functions [Xxi, •• - , *„)] 

by f{X), etc. Thus, f(X) = 0 implies /‘(xi, - ••,*„) = 0, where (t = 1, •••,«). 

50 Quite possibly one might imagine random or chance errors causing deviations 
from the equilibrium defined by the equations (10) so that the moving equilibrium repre- 
sents a smoothed out trend of the actual observations. One might have f(x -f - e, i) = 0, 
where e is a random variable. 
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namely, 

q* = iO, 


(IS) 


There exists a one parameter family of motions, each motion is 
determined by efficacious forces, but not all can be termed moving 
equilibria or the term loses significance 

If time did not enter explicitly into our equations, i e , if neither 
the demand curve nor the supply curve were shifting, the stationary 
equilibrium would clearly be defined by 


D(p-) - ? = 0, 

s(p-) - <r = o 

For this steady motion, and here defining Ap t = p t — p , , 


(16) 


Pt = Pt-, = = P°, 

q, “ g«-i = = q 9 , (17) 

Apt s A?, h 0 

When time is explicitly mvolved what is the analogous position 
of moving equilibrium ? Let us consider the following alternative 
definitions which have been advanced (1) the definition which 
Moore seemed to urge whereby the position of moving equilibrium 
is to be represented as a statistical trend, (2) the moving equi- 
librium defined by the ‘‘equality’' of supply and demand, (3) the 
moving equilibrium defined by the 'Frisch barring process ” to be 
described later 

1 The representation of equilibrium as simply a statistically 
fitted trend has apparently no universal validity In particular, 
if this method is applied to the causal case (especially where the 
approach to equilibrium is nonoscillatory), it need not lead to the 
correct equilibrium level, nor necessarily to any stationary level 
Probably its apparent acceptance by Moore rested in part upon its 
supposed relation to the criterion to be described next For 
stochastical systems, discussed later, trend fitting is more de 
fensible, but still not optimal 

2 The criterion that moving equilibrium be defined by equi 
librium of supply and demand seems on first sight a natural general- 
ization of the stationary case But, upon examination, ambiguities 
appear in this formulation Given any price demand reacts m 
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stantaneously, supply after one year. What sense is there in 
equating the two? (Of course, each year in the shortest run price 
is determined by the equality of demand and short run supply for 
all possible motions.) Nevertheless, if the shifts are very “slow,” 
one might as a first approximation disregard the differences be- 
tween p t and pi - 1 and equate 

Dipt, t) = Sip,-* t ) = S{p t - Ap t , t) = S(p t , t) (18) 

on the assumption that Apt = 0. Solving 

Dipt, t) - S(p t - 0, t) = 0, (19) 

we get 

Pt = Pxit), (20) 

and this might be termed the moving equilibrium for price. Para- 
doxically, on the assumption that price is not changing, we derive 
a moving equilibrium path for price! But this is characteristic of 
the method of successive approximation. Assumptions known to 
be in error are made, later to be corrected. A second approxima- 
tion would be derived by making 

A p t = Ap\(t) (21) 

and equating 

D[p 2 (t), q - S\j 2 {t) - A p l {t), Q = 0, (22) 

and the nth approximation by the relation 

D[p n {t), q - s[p n {t ) - a p^t), q = 0 . ( 23 ) 

If the sequence of functions [£,-(/)] converges uniformly to a limit 
function pit), then p(t) is a solution of our original system. 

The method outlined above is essentially identical with the 
moving equilibrium of a biological or chemical system undergoing 
slow changes. 21 If the system is defined by 

x = f(x, t), (24) 

where ft is small, the moving equilibrium (first approximation), 
is defined by 

= 0 = fix 1 , t); x 0 = constant; (25) 

21 See A. J. Lotka, Elements of Physical Biology (Baltimore, 1925), chap. xxi. This 
contains numerous references. 
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the second approximation x t (t), by 

*1 -/(*. t), (26) 

and the nth by 

« f(x n t) (27) 

The sequence of functions £*»(0U ma y or Ilia y not converge uni 
formly to a limit function, if it does converge the limit function is 
one solution of the original (Instead of a single variable any 
number could be involved without affecting the argument ) 

The above definition has the following advantages in its favor 
For nonhistoncal causal systems not involving time explicitly, it 
does yield the correct positions of stationary equilibrium at the 
first approximation Also the successive approximations are in 
variant under changes of variables of the form 

y =/(*), x~f~\y) 

and changes of time 

t' = t = g _, (0 

Nevertheless, these advantages do not outweigh the overwhelm 
mg shortcomings involved in this definition What after all is 
being approximated ? Suppose all solutions of equation (24) were 
known exactly to be of the form 

x = W,x) (28) 

There would then be no need for approximations at all Which of 
these solutions could legitimately be termed the moving equihb 
rmm ? The particular method of successive approximation out 
lined above, if it converges at all, selects out arbitrarily and 
mysteriously the particular solution to be dignified by this title “ 
I reject, therefore, the above method of approximation as defining 
a unique moving equilibrium, although it may be appropriate for 
axtivuis at paxticular solutions 

For a linear differential equation of the form 

x - o(t)x = bit), (29) 

the general solution equals any particular solution plus the general 

n The success ve fund ons m the approximation sequence x ( l ) do not m general 
satisfy the same initial conditions so that only in the limit can one determine which 
initial conditions the final solution attained will satisfy 
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solution of the reduced homogeneous equation 

x - a(t)x = 0. ( 3 0 ) 

Let 

* = «t(f) (31) 

represent the general solution of (30), where c is a parameter 
specified by initial conditions. Let v x (t) be a particular solution 
of (29). Then 

»i (0 + cu(t) (32) 

is the general solution of (29). Clearly also 

[01 (t) + au(t)~\ -f- ku(t) (33) 

is a general solution since the expression in brackets is a particular 
solution for a specific a. 

In some fields (electrical network theory, etc.) special signifi- 
cance is attached to that particular solution which “contains” no 
terms of the form [ \au(t) ], and it might be thought that this might 
be an acceptable definition of moving equilibrium. Where the 
functions involved are not the simple elementary ones, the above 
criterion does not specify unambiguously a unique function. But 
it can often be modified to do so. In fact, the limit function 
defined by convergent approximation sequences of the type de- 
scribed in the previous section often represents this particular solu- 
tion. I am not aware, however, of any useful purpose to be served 
by embracing this definition. 23 

There is one interesting interpretation of the first approximation 
which deserves detailed attention. One often encounters the no- 
tion of an equilibrium which, before it is reached, recedes because 
of historical perturbing forces. Imagine a boy carrying a stone at 
the end bf a piece of string. If he were standing still, equilibrium 
would be attained when it hung vertically at rest. But as he walks 
the stationary equilibrium is never attained, the position in which 

23 In linear systems (electrical, etc.) where there is an impressed periodic force, the 
general solution of the differential equations can be written as the sum of a purely 
periodic motion and a transient. In the case of damped systems the transient necessari y 
goes to zero in the limit, and the motion necessarily approaches the purely periodic func- 
tion. For some purposes it might be convenient to define the purely perio ic unction 
as a “moving equilibrium” and give it privileged treatment, particularly i t e perio 
function is a pure sine wave. 
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equilibrium might be possible if his motion were to stop being 
constantly displaced 

We have here what may be termed receding equilibrium For 
the time t it is the position of stationary equilibrium which might 
be attained if subsequent to t all historical changes were suspended 
That is to say, in all our functional equations a bar is placed over 
t wherever it occurs explicitly for times subsequent to t The result 
is a hypothetical or virtual complete causal system For a new t 
we have a corresponding virtual equilibrium and thus is defined 
our receding equilibrium as a function of time 

Let us examine a simple example to illuminate this concept 
Let 

x + x = t (34) 

be a historical system At t equal to t the receding equilibrium 
is the stationary equilibrium corresponding to the causal system 

f + *M - t, (35) 


where the right hand member is treated as a constant Clearly, 
such a position is given by 

0 + x = 1 (36) 


But this is precisely the first approximation x t (f) of equation (25), 
specifically in this case 




(37) 


The virtual path of the system after the freezing of historical 
change is only one possible solution For other initial conditions 
at time t different solutions will be defined In particular, let us 
consider initial conditions u(l, x ), where « is a true solution of the 
historical system (34) The corresponding virtual path may be 
written 

x *= v [t, «(<, a;)] (38) 

For the specific system defined by (34) 

u(t, x) = t — 1 + (x + 
v[t, u(t, i)] = (-j'+i+ l)e~‘ 4- 1 


(39) 

(40) 
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The actual solution of the historical system is seen to be an envelope 
to the family of virtual solutions v[t, u(t, x)]. The virtual move- 
ment for short times approximates closely the true movement in 
that they are mutually tangent. That is, if the boy were suddenly 
to stop walking, the pendulum would change its motion; but the 
new motion would be tangential to the old, and for short intervals 
they would not diverge much. 

The virtual curves approach asymptotes or hypothetical sta- 
tionary equilibria. These equilibrium values plotted against time 
form the curve of receding equilibria. This function of time is not 
in general a solution of the true dynamical equations. 24 The sta- 
bility of the hypothetical equilibrium can, however, be determined, 
and, as will be seen later, bears some relation to the stability of the 
actual motions. 

3. I turn now briefly to the “normal” values or “equilibrium” 
values defined by Professor Frisch. 26 This is defined by relaxing 
a subset m of our n functional equations 

• • • , *„‘(t), H=0, (* = 1, ---,») (41) 

-w -® 

and replacing them by a set of m hypothetical equations 

g*'(* 1 , • • • , Xn, t ) = 0. (42) 

—00 —co 

In the new set of n equations the instantaneous forms of the vari- 
ables (at least in some of the places where they occur) are barred 
and regarded as unknowns. Everywhere else (in all dynamic 
forms of the variables and possibly in some places where the in- 
stantaneous forms appear) is substituted one solution of the original 
set of functional equations defined for a particular set of initial con- 
ditions. This yields n historical equations of the special form (10) 

24 It Is possible for the receding equilibrium to be one solution as, for example, in 
the system 

V + y = *• 

y — l is the receding equilibrium and at the same time an actual motion of the system. 
If the boy has always been walking at a steady pace, the pendulum may be hanging 
vertically. 

25 R. Frisch, “On the Notion of Equilibrium and Disequilibrium,” Review of 
nomic Studies, III (1936), 100-105. 
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in the barred variables, (xj, • , x,). This defines the moving 

equilibrium. 

Professor Frisch is well aware that this definition does not 
correspond in the stationary case to the concept of equilibrium; he 
even thinks that the former will eventually replace the latter “as 
the tendency to formulate the economic reasoning in exact dynamic 
mathematical terms gains ground.” 28 But this concept, he be- 
lieves, is what many modem writers have in mind. 

Whether or not this last conjecture is true, I do not know, par- 
ticularly since I am unacquainted with the original Scandinavian 
writings. 27 But for the present purpose this is irrelevant; it is 
only important to point out the differences between this concept 
of a “normal” and the moving equilibrium for which we are looking. 

Some of the salient properties of the "normal” movement are. 

(1) It is in general not a solution of the original set. There 
exists no set of initial conditions which would produce it in actuality. 

(2) It is not unique, but depends upon as many parameters as 
are necessary to specify initial conditions for the original Set of 
equations This is because it depends upon the particular solution 
— out of an infinity of possible solutions — inserted in the modified 
set of equations / and g. (See the italicized sentence above.) 

(3) Being defined by essentially static, albeit possibly historical 
equations, there is no sense in asking questions concerning its 
stability. 

For all these reasons it is unsatisfactory as a representation of 
moving equilibrium for our present purposes s * 

"Ibid, p 102 

17 Professor Frisch illustrates this concept by referring to the WickseUian relation 
between the actual and natural rate of interest. In so rationalizing the arguments of 
some neo-Wicksellians he is. I fear, over charitable, ascribing to them an undeserved 
degree of sophistication Actually, Wicksetl himself considered for the most part a 
system of implicit dynamic relations not involving time explicitly, i e , without historical 
change (Witness, for example, his controversy with Professor Davidson on the equi- 
librium requirement of stable prices in a system undergoing output expansion because 
of irreversible technological change ) Within a nonhistorical framework it is possible 
to construct dynamic models of the WickseUian s>stem in which the natural rate of 
interest represents the stationary equilibrium level of the system and not the solution 
of a hypothetical alternative system If this be granted, what is the corresponding 
condition for a system in which elements of historical change have been introduced? 
This brings us back to our original query. 

11 Still another notion of moving equilibrium is suggested by an explosive exponential 
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Resolution of the Problem 

We have examined in turn alternative definitions of moving 
equilibrium and found good reasons for rejecting each one. This 
suggests a reexamination of the purpose for which it is desirable to 
find a concept of moving equilibrium. 

It will be remembered that in the case of nonhistorical causal 
systems there was found to be no ambiguity in the definition of 
a stationary equilibrium state ; in fact, it was the hope of generaliz- 
ing the notion of a stationary state for historical systems that 
motivated the search for moving equilibria. 

Let us go back a step and ask why we were interested in a 
position of stationary equilibrium. Clearly it is only one out of 
an infinity of possible motions of the dynamical system in question. 
But, and this suggests the answer to our question, if the stationary 
equilibrium is stable, all motions approach it in the limit P 

We were concerned then with the behavior of all the motions 
of the dynamical system ; and it was simply in the nature of a co- 

solution of a causal system such as 

y - y = 0. 

This has solutions of the form 

y = ke‘. 

On the above definition every motion but the stationary equilibrium one would be a 
moving equilibrium. By definition all the moving equilibria would be unstable. The 
effect of small shocks would be multiplied through time. On the other hand, the variable 
( y/y ) would be relatively stable. 

The above concept which applies, for example, to a population with constant specific 
fertility and mortality is in contrast to the moving equilibrium represented by the 
growth curve of an infant. Given a small shock (measles, etc.), the child’s weight will 
deviate from “normal” growth, but afterwards will catch up so that no evidence of the 
interruption will remain in the later progress of the child. Similarly, an economy re- 
covers from the effect of (say) a war and continues on its secular path. But the popula- 
tion referred to above is forever different in absolute numbers after a shock (war, 
etc.) even though its relative age distribution and other features may again approach 
stable form. 

28 Let x°(t ) = x° represent the equilibrium position of a system. Let x'(t) represent 
any other motion. Then, if stable, 

lim x’(() = x°; 

/-*eo 

i.e., for any given positive «, however small, there exists a 1 ° such that 
\x'(t) — < e, for l > 1 °. 

But this is symmetrical in x‘(l) and x° so that the equilibrium position can be said to 
approach any motion, as well as vice versa. It easily follows that *’(/), any motion, 
approaches x l (t), a particular motion selected arbitrarily. 
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incidence that this problem could be studied by an examination of 
the properties of a single special one 1 1 will perhaps be enlighten 
ing to point out that even in the nonhistoncal case we can just as 
well concentrate upon any other single motion than the equilibrium 
one For if they all approach the equilibrium motion then they 
all approach each other And every motion must approach any 
one selected at random 

This suggests the following answer to our dilemma We shall 
concern ourselves not with the stability of a particular motion of a 
historical system which can be given the privileged title of moving 
equilibrium but rather with the stability of each and eiery motion of 
the system This will not preclude our designating for special at 
tention particular motions with special properties (as for example 
strictly periodic motions steady motion in which some coordinate 
increases linearly etc ) 

This suggests how I would answer the purely verbal problem 
what processes shall be designated as equilibrium processes? Once 
we have stripped the equilibrium concept of normative and teleo 
logical connotations it does not much matter how we apply the 
term The term may be applied to stationary \ alues only From 
this narrow point of view a competitive industry would be in equi 
librium only after all long run conditions had been satisfied entry 
of firms correct output for each firm etc 

For other purposes we might apply the term to those inter 
mediate short run situations in which each firm is producing at 
price equal to marginal cost (and above average variable cost) 
even if price diverges from average cost so that the number of 
firms is changing or is about to change It might be reserved for 
situations of the last described type where the further condition is 
realized that the number of firms is changing at a causally deter 
mined rate Still others may be willing to apply it to momentary 
conditions of supply and demand including variations in speculative 
stocks of the good in question 

I, myself find it convenient to visualize equilibrium processes 
of quite different speed some very slow compared to others 
Within each long run there is a shorter run and within each shorter 
run there is a still shorter run and so forth in an infinite regression 
For analytic purposes it is often convenient to treat slow processes 
as data and concentrate upon the processes of interest For ex 
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ample, in a short run study of the level of investment, income, and 
employment, it is often convenient to assume that the stock of 
capital is perfectly or sensibly fixed. Of course, the stock of capital 
from a longer run point of view is simply the cumulation of net 
investment, and the reciprocal influence between capital and the 
other variables of the system is worthy of study for its own sake, 
both with respect to a hypothetical final equilibrium and the simple 
course of growth of the system over time. 

So to speak, we are able by ceteris paribus assumptions to dis- 
regard the changes in variables subject to motions much “slower” 
than the ones under consideration; this is nothing but the “per- 
turbation” technique of classical mechanics. At the same time we 
are able to abstract from the behavior of processes much “faster” 
than the ones under consideration, either by the assumption that 
they are rapidly damped and can be supposed to have worked out 
their effects, or by inclusion of them in the dynamical equations 
(derivatives, differences, etc.) which determine the behavior of the 
system out of equilibrium. 

The first of the above mentioned alternatives constitutes the 
justification for the use of comparative statics rather than explicit 
dynamics. • If one can be sure that the system is stable and strongly 
damped, there is no great harm in neglecting to analyze the exact 
path from one equilibrium to another, and in taking refuge in a 
mutatis mutandis assumption. Of course, if one chooses to neglect 
certain dynamic processes, one may still retain others; e.g., in 
studying capital formation over two decades T may choose to 
neglect inventory fluctuations, but still may retain the acceleration 
principle in its secular aspects. 

Under the second' alternative where shorter run processes are 
contained in (say) the differential equations of the system, it is to 
be understood that these differential equations do not necessarily 
hold exactly at each instant of time. There may well be a still 
shorter run theory which explains how still higher differential equa- 
tions lead to (rapidly) damped approaches to the postulated differ- 
ential equation relations. And so forth in endless regression. 

It may be argued that so general a connotation is at variance 
with traditional usage of the word equilibrium. Is it not straining 
language to think of a cannon ball as being at equilibrium, not 
only after it has fallen to the ground at rest, but also at every point 
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in its flight, when it is on its mean trajectory as well as in its pre 
cession around this path? Perhaps such terminology may occa 
sionally lead to confusion , however, with carefully stated qualifica 
tions it may be convenient 

To examine the stability of any motion [u t (/) , «»)3 simply 

substitute into the functional equations 

*.(<) - ».« + •?.«) (43) 

where the above equation is simply a definition for the 17 ’s There 
result n functional equations in the 17 s since the u s are given func 
tions Because the «'s constitute a solution of the original system 
there exists a solution in the 17 s of the form (0 0, , 0) The 

original solution will be stable if for all possible initial conditions 

hm 17 (0 = 0 (» = 1 n) (44) 

Moreover provided the original functional equations arc of a 
very general class (differential equations, difference equations, in 
tegral equations etc ) the resulting equations can for sufficiently 
small displacements be regarded as linear m the jj’s They arc not 
however independent of time, but ordinarily will involve it 
explicitly 

This is most easily seen for a single variable Let our implicit 
functional equation be 

JWr) + ,(r) 0 = 0, (45) 

where 

= 0 (46) 

because u is a solution Providing certain assumptions are made 
concerning the continuity of the functional / and its functional 
dftcwatwis, <a£ Uvstvtt wAm, tVere. crafts av. esprvwEM a®, 'resy mudx 
like Taylor’s expansion for ordinary functions of a point This 
takes the form 

fWi + I7(r) 0 “ 0 + J Ki(t, Ti)r](Ti)dri 4 - , (47) 

where K 1 is the functional derivative of / This seems to result 
only in linear integral equations, but by use of the Stieltjes integral 
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or the Dirac operator 8(x) with the property that 

f /(*)$ (* ~ a)dx = f{a), 

♦-/ — on 


and 


f f(x)8'(x - a)dx = - f(a), 

— co 


(48) 

(49) 


the general linear differential and difference equations can easily 
be written as linear integral equations. Thus, 


takes the form 

di + = 0 

(50) 


J ' K(t, T)x(r)dT = 0, 

(51) 

where 




*^ s ‘ 

I 

F- 

+ 

1 

to 

1 

II 

T' 

(52) 


The convertibility of the problem of stability at least in its 
first order aspect into an examination of linear systems is of in- 
estimable value since the largest part of existing mathematical 
knowledge relates to such systems. 


Concepts of Stability 

In chapter ix there was some discussion of various kinds of 
stability. Our treatment would not be complete without at least 
a cursory survey of the various senses in which this term has 
been used. 

(a) It has sometimes been used in a very broad sense. Any 
position of equilibrium is stable if deviations from it remain 
bounded. If no motions go out to infinity, then each is stable. 
For many purposes this is unsatisfactory. Better names than 
stability can be suggested for this property. An egg balanced on its 
tip upon a level plain would be stable in the above sense. 

(b) Poincard implied still another definition when he coined 
the phrase stability in the sense of Poisson. “This means that, 
although the system may not exactly repeat itself starting from an 
arbitrary initial state, yet in general it will return to the vicinity 
of its initial state and nearly repeat its motion during a long interval 
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of time.” 40 This is quite different from the usual meaning of 
stability, and the term recurrent is perhaps more appropriate to 
systems employing this property. 

(c) A very common use of stability in nonconservative physical 
systems is the one which I have termed stability of the first kind. 
It holds when every motion approaches in the limit the position 
of equilibrium (and every other motion). It is not reversible in 
time; going backwards, all stable systems become unstable . 41 

Subdivisions under this heading are of course possible. Thus, 
there is stability of the first kind in the small if in a sufficiently 
small neighborhood of a given motion all motions arc stable. First 
order stability of the first kind prevails when certain sufficient condi- 
tions hold; namely, when the linear terms in the expansion of our 
functional equations taken by themselves yield a system which is 
perfectly stable We have been concerned almost exclusively with 
this definition of stability. 

(d) All conservative physical systems are reversible in time, and 
volumes are preserved in phase space. This rules out stability of 
the first kind. A conservative system free of the dampening dissi- 
pative force of friction, if displaced from stable equilibrium, never 
returns to rest at stationary equilibrium A pendulum displaced 
from the perpendicular has higher total energy (equal to potential 
energy plus zero initial kinetic energy) than the equilibrium posi- 
tion since the latter represents a minimum Total energy must 
be conserved, so, in passing through the equilibrium position, the 
system must possess motion in order to provide some kinetic 
energy to be added to the minimized potential energy. 

(e) There remain still other notions such as permanent stability 
(the previous definition holding for all time between minus and 
plus infinity), semi-pcrmanent stability in which the above proper- 
ties hold for "long” periods of time, complete or trigonometric sta- 
bility in which the motion can be approximated by certain har- 
monic sums, etc . 42 

10 G D Birkhoff and D C Lewis, Jr , “Stability in Causal Systems," Philosophy 
of Science, II (1935), 310 

" This might be termed stabilit) in the sense of Ltapounoff. (See E Picard, Tratte 
D'Analyse, III, p 200 ) It is also shown there that instability of the first order rules 
out stability in this sense Birkhoff has termed this unilateral stability. 

” See G D Birkhoff, Dynamteal Systems (New York, 1927), chap iv, for more 
detailed discussion. 
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It is unnecessary to go further into this problem except to point 
out that unlike the greater part of the subjects treated here, the 
analysis of the concept of stability with the implied investigation 
of the qualitative behavior of generalized dynamical paths leads 
into some of the most difficult problems in higher mathematics. 

Nature of the Business Cycle 

We have seen that there are many interesting and fruitful as- 
pects of dynamics which have nothing to do with the business cycle 
as such, but which are important for the understanding of processes 
usually classified under the heading of economic theory. However, 
of all branches of dynamics the one which has received the greatest 
attention is that dealing with the fluctuations in employment, in- 
come, and general business activity. Numerous explanations, as- 
sociated with a variety of names, have been put forward. In fact, 
there have been so many different theories that it has been neces- 
sary to devise a number of different systems of classification in 
which to catalogue them (e.g., underconsumption, overinvestment, 
etc.; or frictional, institutional, monetary; exogenous versus endo- 
genous; etc.). I should like to survey these in a brief way in order 
to isolate the analytical differences involved rather than to concen- 
trate on the historical, institutional, and personal variations. 

(1) It is a commonplace today to employ the terms exogenous 
and endogenous in describing cycle theories. The former refers to 
theories which find the origin of the cycle in some outside, non- 
economic datum which varies in a quasi-periodic manner, and by 
one way causation engenders a cycle in economic time series. A 
sunspot or meteorological theory is usually offered as the prototype 
of this class. Inasmuch as the economic fluctuations whose ex- 
planation we are seeking are not themselves strictly periodic, or 
more than quasi-oscillatory, it is not really necessary that the 
exogenous factors be any more regular in their periodicity and 
amplitude. Nor need the exogenous factors he completely inde- 
pendent of reciprocal influence from the economic system; strictly 
speaking, it is only necessary that lines of causation be largely one 
way in direction, from the outside in. Analytically the extreme 
exogenous theory is analogous to a “forced” periodic motion, where 
the economic system reacts instantly to the external driving im- 
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pulse In terras of my earlier six categories the system is his 
toncal but static 

(2) At the other extreme there is the purely endogenous theory 
or the so called self generating cycle Various theories emphasiz 
mg monetary factors inventories the acceleration principle psy 
chology etc fall under this heading The determinants of the 
system consist of dynamic equations involving different periods of 
time (lags derivatives etc ) which generate recurrent motions 
Once the cycle is under way boom gives way to depression depres 
sion to revival revival to boom and so forth 

Some economists think that illegitimate circular reasoning is 
involved in this view that it shows how the cycle once started 
perpetuates itself but begs the question as to the original existence 
of the cycle Even if valid this would be an objection of no par 
tjcuiar consequence since there are an infinity of exogenous and 
chance factors which could give rise to the initial cycle Actually 
as will be shown below a cycle is not required to start off the 
process but only an initial displacement from equilibrium however 
small and noncyclic in character 

Endogenous Models 

Analytically the pure endogenous cycle is usually likened to 
the motion of a fnctionless pendulum which satisfies a simple 
Newtonian second order differential equation Upon closer ex 
animation difficulties appear with this notion In the first place 
all dampening must be ruled out or else the cycle will come to an 
end similarly in most theories anti dampening or explosive be 
havior is ruled out Now in a physical system there are grand 
conservation laws of nature which guarantee that the system 
must fall on the thin line between dampening and anti dampening 
between stability and instability But there is nothing in the 
economic world corresponding to these laws and so it would seem 
infinitely improbable that the coefficients and structural relations 
of the system be just such as to lead to zero dampening 

There is still another difficulty Even within an endogenous 
theory the economic system is not regarded as isolated It is sub 
ject to outside disturbances but these are not taken to be related 
to the business cycle in an important way If we consider a system 
which taken by itself is on the border line between stability and 
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instability, it may be said to be stable in the second sense discussed 
earlier. But if we now let random variations impinge on the sys- 
tem, it becomes unstable in the sense that the (expected) amplitude 
of its swings increases with time, and its variance approaches 
infinity in the limit . 33 

I think it ill-advised, therefore, that Kalecki 34 in an empirical 
determination of the coefficients of an assumed mixed difference- 
differential equation system should have imposed the conditions 
that the motion have zero damping. One can sympathize with 
his aim of deriving a system with constant amplitude of fluctuation, 
whatever one thinks of the empirical foundation for this hypothesis. 
But after imposing a highly improbable constraint, he did not 
achieve what he was looking for; namely, a system with constant 
amplitude of fluctuation. 

Equally important, the purely endogenous analyses so far dis- 
cussed are unable to provide an explanation for the exact or ap- 
proximate amplitude of the cycle. Like the pendulum system they 
are essentially linear in character, and every linear system can take 
on any amplitude depending only upon the magnitude and sign of 
the original displacements. Thus, a pendulum can be made to 
swing in a small or large arc by means of a small or large initial 
displacement. To explain the observed level of oscillation of the 
cycle the economist with a linear endogenous theory must go back 
to the value of the original prehistoric disturbance, and must ex- 
plain why succeeding shocks have not swelled the cycle. 

There are two avenues of escape from the fundamental diffi- 
culties encountered in the simple, linear, purely endogenous models. 
The first, which has been widely discussed in the technical litera- 
ture, involves dropping the assumption that the system possesses 
zero dampening, and placing reliance upon external shocks to keep 
fluctuations from dying down. This solution involves dropping 
the assumption of a purely endogenous system, but permits the 
retention of linearity assumptions. It is discussed in detail in 
later sections. It is to be stressed that the exogenous impulses 
which keep the cycle alive need not themselves be even quasi- 
oscillatory in character. 

33 On p. 343 below the sum of the A’s will not converge nor will the sum of their 
squares, where there is zero dampening. 

*» M. Kalecki, “A Macrodynamic Theory of Business Cycles,” Economelrica, III 
(1935), 327-352. 
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The other alternative is to drop the assumption of linearity even 
though this entails considerable mathematical difficulties For the 
qualitative problem of stability, relations 4 in the small were seen 
to have important implications as necessary conditions for stability 
or instability in the large ' In consequence it was useful and 
permissible to work with linear relations as a first approximation 
even though that approximation was admittedly not exact But 
in the realm of quantitative dynamics and economic fluctuations 
this approximation must be dropped if we are to get correct quan 
titative and qualitative results 

For linear systems are lacking in the qualitative richness of 
nonlinear systems The former are either damped or undamped 
stable or unstable regardless of the size of initial displacement 
Nonlinear systems for the first lime introduce a theory which accounts 
for fluctuations of a particular amplitude, independent of the initial 
displacement Thus a nonlinear system may possess an unstable 
stationary level so that when the equilibrium position is displaced 
in the slightest ev er increasing swings are begun But instead of 
oscillating to infinity, a particular amplitude is finally reached and 
maintained This periodic motion may be stable in the sense that 
any further disturbance will result in a motion which approaches 
the given periodic motion from above or below 

As an example of such a nonlinear equation, constder the 
differential equation 

x — /i(l — x*)x -J- x = 0 (S3) 

While zero is a stationary equilibrium level of the system an 
evaluation of the coefficients of the above equation for that level 
will show that the equilibrium is unstable For small departures 
from equilibrium the system behaves like an explosively oscillatory 
linear system but finally the nonlinearity asserts itself and the 
motion settles down to a fixed amplitude Geometrically, in the 
(x dx/dt) phase space the equilibrium point is given by the origin 
out from which flow expanding spirals However, each of these 
approaches a closed curve representing the periodic motion If we 
begin initially from a point outside the closed curve we have con 
tracting spirals which approach the closed curve from without 
The stability of the periodic motion in the small is verifiable by 
examining the solution of a variational linear differential equation 
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of the second order with periodic coefficients, derived by evaluating 
the coefficients of (53) along the periodic motion in question. This 
equation of variation in turn is seen to be stable because its 
“characteristic exponents or multipliers’’ all have negative real 
parts. 35 The uniqueness of the periodic solution is more difficult 
to establish, but in the case cited it has been done. 

Such nonlinear systems have received a certain amount of 
attention in theoretical mechanics under the headings of “self- 
exciting oscillations” or “relaxation oscillations.” 36 Much still 
remains to be done in this field. In the field of economics one 
nonlinear system has received complete treatment. I refer to the 
so-called cobweb theorem in which supply lags one period. In 
general, this leads to a nonlinear difference equation of the first 
order, which is so simple as to permit of a complete graphical 
solution. 

As we have seen in previous chapters, the linear cobweb case 
permits of only three possibilities: for any amplitude the system is 
either damped, anti-damped, or exactly in between. (Actually in 
real life, both in an economic or physical system, a movement to 
infinity is unthinkable. Therefore, if a linear system is unstable, 
its motion will grow until it no longer remains linear; its structure 
will "give,” etc.) Dropping the assumption of linearity, we find 
that the system may possess special periodic motions other than 
that of stationary equilibrium. Thus, with the equilibrium point 
unstable, there will be a stable “box” approached by all near-by 
motions, the size of the box determining the unique amplitude of 
the cycle. There may, of course, be several such periodic motions, 
each being alternately stable and unstable. In real life the last 
one can in most cases be taken as stable. 37 

33 See G. D. Birkhoff, Dynamical Systems, chap. iii. 

36 Equations of this type are sometimes known as Van der Pol equations after the 
name of the man who has devoted a good deal of attention to them. However, their 
history goes back into the 19th century at least. A partial bibliography can be found 
in T. Von Kantian, “The Engineer Grapples with Nonlinear Problems, Bulletin of the 
American Mathematical Society, XLVI (1940), 615-683. See also Norman Levinson and 
Oliver K. Smith, “A General Equation for Relaxation Oscillations,” Duke Mathematical 
Journal, IX (1942), 382-403; B. Van der Pol, “Relaxation Oscillations,” Philosophical 
Magazine, II (1926), 978-992. 

33 There is a vast literature on the cobweb theorem. The most complete treatment 
seems to be that of W. Leontief, “Verzogerte Angebotsanpassung und Partielles Gleich- 
gewicht,” Zeitschrift fiir Nalionalokonomie, Band V (1934). 
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Elsewhere m economic analysis there are to be found suggestions 
of nonlinear dynamics " However the formal difficulties of solu 
tion are so great th it very much remains to be done This is all 
the more important since a careful analysis of various literary 
theories will show that some of the most simple of them depend in 
in essential way upon nonlinear elements Among these is a wide 
class of what I have termed billiard table’ theories, which deduce 
a turning point of the cycle from such considerations as the fact 
that the system hits the full employment ceiling and bounces 
back so to speak ' Another of this type is the Hawtrcyan notion 
of the minimum reserve ratios of banks against which the system 
rebounds Such notions are less successful m explaining the lower 
turning point since there is no (relevant) natural bottom to the 
economic system S9 

It is not unexpected to find that the simplest empirical notions 
may lead to the most complicated mathematical problems This 
is a fact to inspire humility in both literary and mathematical 
investigators but should prove discouraging to neither 

Mixed Exogenous Endogenous Theories 

It is not necessary to subscribe exclusively to one of these two 
polar types Perhaps most economists are eclectic and prefer a 
combination of both For example an economist who believed m 
the reality of waves of different length might plausibly regard the 
long Kondratieff wave as being primarily exogenous in character, 
depending as it does upon wars gold discoveries, and great tech 
nological upheavals Its impact upon the system might involve 
endogenous transient cyclical movements but their movements 
might take place m a much shorter time period so that compared 

’* See Ph Le Corbe l!er Les Systemes Autoentretenus et les Oscillations de Retixa 
tion Economelrtca I (1933) 328 332 J Tinbergen Statistical Testing of Business 
Cycle Theories (Geneva League of Nations 1939) 

** Professor A. H Hansen mbs Fiscal Policy and Business Cycles (New York 
Norton 1941) chap xi i seems to regard the level at wh ch the average propensity to 
consume is 100 per cent as such a natural bottom I think th s is putting the matter 
much too strongly even though I agree that below this level certa n upward tendencies 
are brought into play R F Harrod s The Trade Cycle (Oxford 1936) places consider 
able emphas s upon nonlinear factors in connection with h s dynamic determinants 
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to the length of the Kondratieff cycle these might be regarded as 
“rapidly damped” and be neglected. 40 

On the other hand, the extremely short fluctuations, to the 
extent that they exist, might be explained almost wholly in endog- 
enous terms with reference to the dynamics of inventories, the 
acceleration principle, speculation, etc. These short cycles would 
perhaps die down were it not for the random and systematic dis- 
turbances received from the longer movements in investment. 

To illustrate the above remarks the reader may construct for 
himself a model in which there is superimposed a regular periodic 
movement of autonomous net investment upon a system in which 
both the multiplier and the acceleration principle are operating. 
The period of the autonomous movement should be longer than 
the intrinsic period of the (damped) responding mechanism. Spe- 
cifically, consider the equation 

Y(t + 2) - a (l + fi) Y(t + 1) + c#Y(t) = P(t), (54) 

where a is the marginal propensity to consume, /? is the so-called 
“relation” of the acceleration principle, and P(t ) is a periodic move- 
ment, not necessarily a pure sine wave. 41 

If the process has been going along for a long time, it will finally 
approach a periodic motion of income in which autonomous invest- 
ment shows a lead over income, and in which the amplitude of the 
final motion is proportional to that of P(t), the exact factor of pro- 
portionality increasing with the nearness of the intrinsic period of 
the left-hand side to the postulated periodicity of the right-hand 
side, i.e., with the approach to resonance. 

However, if we break in at an arbitrary point of time, there will 
be in addition to this motion a damped transient whose qualitative 
properties depend only on the endogenous response characteristic 

40 On the whole, we should rather expect greater irregularity in amplitude and 
periodicity in a cycle generated by exogenous factors than in one endogenous in character. 
This is in accord with the prevailing view that a diversity of factors is responsible for 
the few recorded “long waves” of economic history, and with the notion that there is 
much less predictability into the future of even the qualitative features of such a move- 
ment. On all these matters the reader may be referred to the well-known views of 
Schumpeter, Mitchell, Hansen, et al. 

41 Cf. P. A. Samuelson, "Interactions between the Multiplier Analysis and the 
Principle of Acceleration,” Review of Economic Statistics, XXI (1939), 75-78. E. G. 
Bennion has worked out a number of interesting arithmetical models in which the agree- 
ment between theory and the model sequences is complete. 
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of the multiplier-relation system. This can give shorter cycles, 
brought into life so to speak by the shocks incident to the longer 
wave. 41 

Mixed Systems of a Linear Stochastic Type 

Thus far I have been considering mixed exogenous-endogenous 
systems in which the exogenous forces are periodic in nature, or at 
least quasi-periodic. There is, however, a stochastical theory de- 
signed to explain the existence of quasi-periodic cycles by means of 
a damped system responding to random shocks The latter serve 
to keep alive the fluctuations of the system in spite of dampening. 15 
But in doing so they tend to displace the phase of the given motion 
so that ordinary periodogram analysis will not, in a long series, 
reveal a “significant” period in the vicinity of the intrinsic fre- 
quencies of the responding system. This is intuitively obvious if 
we think of ordinary periodogram analysis as Fourier analysis, and 
the latter as equivalent to fitting by least squares the best single 
harmonic to the time series in question. The ordinate of the usual 
periodogram is equal to that part of the total variance (in absolute 
or percentage terms) of the time series which can be explained by 
the best harmonic of that frequency Because of the constant dis- 
turbance of phase, in a long series no sine wave will give a good fit. 41 

It remains an open question whether other of the usual methods 
of time series analysis (counting distances between peaks and 
troughs, etc ) will suffice to give back the known periods even in 
artificially constructed model sequences of the type of equation (1) 

“ If we drop the assumption of linearity, then the response of the system will show 
some differences The amplitude of income will not be simply proportional to the 
amplitude of the function P(<), nor will the final solution be a simple additive combma 
tion of periodic and transient components. Qualitatively, however, the result will be 
a final approach to a periodic motion with characteristic shorter waves in the transition 
period Unless kept alive, these will die out 

*' H. Wold, A Study tn the Analysts of Stationary Time Series (Uppsala, Sweden, 
1938) 

M Cf. the reference given in footnote 43 See also the brilliant contribution of 
Professor Ragnar Frisch to the Cassel volume, "Propagation Problems and Impulse 
Problems in Dynamic Economics,’ Economic Essays tn Honor of Gustav Cassel (London 
1933), pp 171-205. While ordinary periodogram analysis will not do the “Generaliied 
Harmonic Analysis” of Norbert Wiener is designed precisely for problems of this type 
See the references to his 1930 Acta Mathemaltca article in H. T. Davis, The Analysis of 
Economic Time Series (Bloomington, Indiana Principia Press, 1942), and aLo the refer 
ence there to the 1935 article of Bartels, and the latter’s suggestive concept of the 
"harmonic dial " 
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above. It is clear, however, from the work of Slutsky 45 and others 
that the time series generated by such sequences resemble quali- 
tatively the usually encountered economic time series. 46 

By means of the analysis of the previous chapter it is clear that 
(except for terminal adjustments which become negligible in a long 
series) the solution of the damped dynamic stochastic system of 
the following type 

L(Y) = Y{t) + a y Y(t- 1) + ■ . . + a n Y(t - n) = {z t \, (55) 

where z is a random variable serially uncorrelated in time and 
drawn from an unchanging universe whose first two moments 
(0, a 2 ) exist, takes the following form 

Y{t) = AoZ(l) + A \Z(t — 1) + • • • -f- A„Z(t — n) + • • •. (56) 

As t grows the number of coefficients in this sequence becomes in- 
finite, but in such a way as to leave their sum and the sum of their 

n 

squares finite and equal respectively to (l/X) a,-, k ). By the usual 

0 

central limit theories it is easy to show that (except for terminal 
adjustments) the variance of Y(t) is given by 

V t = (Ao 2 + AY- + ••• +A t 2 )<rA (57) 

or in the limit by 

V„ = k<r 2 , (58) 

where k is finite. The actual frequency distribution of Y(t) tends 
to some limit with increasing t, with zero mean and variance equal 
to the last given expressions. This is usually true even if the orig- 
inal frequency distribution of z is not at all normal (i.e., Gaussian). 

The interested reader can satisfy himself that the superposition 
of a random variable on to a periodic force leads to a forced motion 

45 Eugen Slutsky, "The Summation of Random Causes as the Source of Cyclic 
Processes,” Economelrica, V (1937), 105-146. 

48 The article by Trygve Haavelmo, “The Probability Approach in Econometrics,” 
Economelrica, vol. XII, Supplement (1944), deals with the problems of determining 
empirically such stochastical relations. The article by H. B. Mann and A. Wald, “On 
the Statistical Treatment of Linear Stochastic Difference Equations,” Economelrica, 
XI (1943), 173-220, shows that the conventional least squares treatment of autocorrela- 
tion is (asymptotically) a "consistent” method of determining the a coefficients. It 
would be outside the scope of the present work to go into these problems. A number 
of articles in the Annals of Mathematical Statistics for 1942 deal with the sampling 
distribution of the autocorrelation coefficient. 
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just like that described above except that the mein of the asymp 
totically normal distribution fluctuates according to the periodic 
function described m the previous section He can also work out 
the implications of having a stochastic variable z t which is not 
serially independent 

Non Linear Stochastic Systems 
A more difficult problem is that of developing for a non linear 
system the stochastic theory which corresponds to that dealt with 
in the previous section As far as the present writer is aware this 
is almost completely unexplored ground No more than a bird s 
eye view of the problem can be attempted here 
We consider a non linear system of the form 

m - JIY{1 - 1) , Y{t - n), zm =* 0 (59) 

whose initial conditions may be written in the abbreviated matrix 
form 

r» = [7(-,)] (60) 

where i goes from zero to (« — 1) If desired the 7 s and Z's may 
be considered as column matrices of many xanables As before 
the Z refers to a random variable drawn from the same universe 
over time and without serial correlation Formally the solution 
may be written in the form 

7(0 = F,[Z(0 Z(t - 1), , Z(0), 7 0 ] (61) 

where the exact nature of F depends upon that of/ It has been 
shown in Mathematical Appendix B, Section 8 that with an ap 
propriate redefinition of the variables 7 and Z we may consider 
equations (59) to be of the first order, l e to depend only upon the 
value of 7 one period ago The new Z will now hav e zeros in some 
of its components, but as before there will be no serial correlation 
between successive Z s 

Under certain restrictions upon the partial derivatives off such 
as correspond to the economic reality of any relevant damped eco 
nomic system it should be possible to enunciate limit theorems not 
unhke those holding for linear systems The fact that so many 
well known non linear statistics go to normality in the limit as the 
sample size becomes large suggests that many of these limit 



F UNDA MENTA LS OF D YNA MI CAL THEOR Y 345 

theorems would actually be of Gaussian form. 47 It would be a 
task of some delicacy to establish conditions under which this must 
certainly be so. However, it is not to be thought that in most cases 
the limiting distributions must be of Gaussian form. In a moment 
I shall specify a non-linear system which approaches a limiting 
distribution not of Gaussian form ; in fact, an example will be given 
of one which remains bounded but approaches no stationary (prob- 
ability) state, but oscillates indefinitely in a simple periodic 
fashion. 

The key to a successful analysis of the difficult non-linear case 
lies in shifting our attack from the study of any one particular 
motion as it is bombarded by a particular set of random shocks to 
the analysis of the probability states corresponding to all possible 
shock patterns weighted in accordance with their likelihood. 
These contrasting approaches are to a certain extent similar to the 
contrast between the motion of a single molecule, the totality of 
motions of an ensemble of molecules as in statistical mechanics, and 
the kinetic theory of gases which describes macroscopic states of a 
system. However, the analogy is not perfectly complete and 
should not be pressed too far. 

First, let us assume that the initial conditions at time t are 
known. Then from our knowledge of the probability universe of 
Z t we may write down immediately the conditional probability 
description of Y t+ h given Y t , namely, 

P(Y t +i, F<) = H(Y t+1 , F,), (62) 

where the exact form of H can be easily specified as soon as / and 
the probability distribution of Z, are given. 

Now if we assume that the exact probability distribution of Y t 
is known, and equal to Pt(Y t ), it is possible to write down the 
distribution of Ft +1 in the form 

P (+ i(F/ + i) = f” H(Y I+1 , Y')P t (Y')dY, (63) 

v— co 

It is to be noted that even in the non-linear case the probabilities 
at different instants of time are related by a recursive linear 
functional. 

4 i Unlike most statistics computed from a sample, the F operations on the Z’s are 
definitely not symmetric functions. - 
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For concreteness we may show what form the H takes in the 
simplest case of a one variable first order system of the form 

y<+ 1 ~ ay i + Z t (64) 

with probability density for Z given by R(Z) Then corresponding 
to equation (63) we have the relation 

•Pi+itKi+i) = f * R(y i+> - ayi)Pibi)dy, (65) 

To see whether or not P t approaches a limiting distribution it 
is natural to put the same P in both sides of the above equation 
and solve the resulting equation for the form of the unknown dis 
tnbution This is an integral equation of the Fredholm type bnt 
with infinite limits Tor the special type of R given above and 
for a less than unity, the existence of a limit is assured 

For absolute a greater than unity we know that the variance in 
creases without limit, and that there cannot be a limiting form 
other than the trivial zero solution We may state this in another 
manner If we put a parameter X in front of the integral in (63) 
and ask for a function P which satisfies both sides, such will exist 
only if X is a characteristic value or eigen-value of the kernel II 
For undamped linear systems X *= 1 will not be a characteristic 
value It would be a task of some considerable mathematical 
difficulty to indicate just when the integral equation arising from 
a non linear system had an eigen value equal to unity and to derive 
the corresponding eigen function P(t) 

The problem would be simplified if we could assume that our 
non linear/ gives bounded values, whatever the value of preuous 

Y s This is obv lously possible only for non linear systems, earlier 

discussion of the physical limitations of full employment and zero 
income suggest that it is often realistic to assume barriers which 
guarantee this tact this assumption the absolute > tJvc of 

Y must be less than some number M, and we have — M and +M 
as the limits in the integrals instead of the infinite limits This 
removes the singularity in the Fredholm equation It is to be 
hoped that this continuous but bounded case will be fully anal} zed 
by mathematicians and economists 

I «ha!l confine myself to the simpler case w here h is a discrete 
rather than a continuous variable By making the classification 
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fine enough, we can reach any empirical degree of approximation 
to reality so there is no essential loss of generality involved. By 
replacing the above integrals by Stieltjes integrals, both cases can 
be handled simultaneously. However, in the simpler case under 
discussion, the integrals can be described by sums. But in this 
case F takes on only integral values, and the probability functions 
for Y and Z are at each time denumerable sequences of numbers. 
Corresponding to the kernel H we now have a matrix H whose 
properties depend upon the / function and the probability se- 
quence R(Z). 

However, to make the problem manageable there are other 
difficulties to be taken care of. Even in the simple linear case, 
with Z taking on integral values and the initial F’s taking on 
integral values, the F’s at a later stage will not be confined to 
integral values unless the coefficients of the integral equation are 
themselves integers. Let us satisfy this requirement in our simple 
linear example of equation (64) by setting a equal to unity. Our 
H matrix is then seen to consist of an infinite number of rows and 
columns, each column consisting of the R sequence of probabilities 
referring to the different Z values, with R(0) centered in the 
diagonal of the matrix as follows: 


• • -22(0) 

R(- 1) 

R(- 2)--- 

•••22(1) 

R( 0) 

£(-!)••* 

■■■R( 2) 

R( i) 

R(0) ••• 

■■■R(3) 

R( 2) 

R( 1) 


( 66 ) 


It is not an easy matter to work rigorously with infinite matrices 
and their latent roots and vectors. In this case we can be sure 
that there is a latent root of unity by virtue of the property that 
probabilities must add up to unity, but we cannot attach unam- 
biguous meaning to this, nor solve for the corresponding latent 
vector. 

It might be thought that the difficulty is of our own making 
in that the R sequence has been considered to be infinite. How- 
ever, the assumption of a finite number of terms in R only intro- 
duces zeros in each column after one reaches a certain distance 
from the diagonal. The matrix must still be considered to be 
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infinite in size and if we wait long enough Y can take on any 
integral value however large 

There is the further more serious difficulty that a damped sys 
tem with leading coefficient equal to unity cannot have coefficients 
which are all integers Clearly an equation whose roots are all less 
than unity in absolute value will have a product less than umtj 
and hence a coefficient which is not an integer 

Therefore even in the discrete case I shall make the non linear 
assumption that the difference equation which defines the dynamical 
path of the system is such as to yield bounded values of Y, It is 
convenient to assume that the T sin addition to being discrete take 
on only a finite number of values in this bounded interval 1 e we 
round off to some degree of accuracy Because we make our 
classifications as fine as we wish there is no serious loss of generality 
involved In this case the probabilities at time t of the different 
values of Y n in number can be represented by the equation 

Pi + 1 = HP, (67) 

where the P s are column matrices of n elements and the H is a 
square n by n matrix whose properties depend upon those of the 
non linear system and on the probability distribution of Z It 
will be noted that the sums of the columns of H are m every case 
unity This follows from the consideration that if Y is certainly in 
position i at a given time (so that the P vector has zeros everywhere 
except for a unity in the ith element) then it must certainly be 
somewhere one period later But its probability of being in each 
of the n positions one period later is under these conditions nothing 
but the tth column whose sum must therefore add up to unity 
Note also that all of the elements of H must be positive in virtue 
of their interpretation as conditional probabilities 

Because the columns add up to unity we place a minus one in 
each diagonal and add each row to the first row The result is a 
singular matrix with zeros in the first row This shows that unity 
is a latent root Similarly the fact that all elements are positive 
and add up to unity guarantees that there are no latent roots which 
exceed unity in absolute value For if there were we could select 
initial conditions for which a given element in P would grow ex 
ponentially without limit This contradicts the assumption that 
(probability) element of P can exceed unity 
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In most cases there will be one root of unity and all other roots 
less than unity in absolute value. Thus, whatever the original 
probability distribution of Y, it will gradually approach a station- 
ary probability state given by the latent vector of H corresponding 
to the unit latent root. It will satisfy the equations 

HP = P, (68) 

where the P is normalized so that its elements (and not their 
squares) add up to unity. The exact values of the P’s can be de- 
rived by solving (n — 1) linear equations. It will be noted that 
the solution secured will have nothing to do with the limiting 
Gaussian form. 

In special cases the unit latent root may not be simple so that 
no unique stationary probability distribution is reached. In still 
other cases there may be another latent root whose absolute value 
is equal to unity, and which is either complex or equal to minus one. 
In either case for “most” initial probability states there will be no 
approach to a stationary probability state; instead there will be 
periodic oscillation. A simple example is provided by a 2 by 2 H 
matrix of the form 

[? J]- (69 > 

If we start out with a probability vector (a, b ), it gives rise, because 
of the negative latent root, to the oscillatory sequence ( b , a), (a, b), 
(b, a), etc., without ever approaching a limit. 43 

48 See W. Feller, An Introduction to Mathematical Probability and Its Applications (New 
York; Wiley, 1950) for discussion of Markoff processes. 
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CONCLUSION 

Economics is a growing subject in which very much is left to be 
done It is only appropriate, therefore, in bringing this work to a 
close, to indicate a few of the important unsolved problems crying 
out for further investigation 

In the first and second chapters I outlined the general problem 
of comparative statics how from a knowledge of the qualitative 
and quantitative properties of our equilibrium conditions we can 
hope to deduce meaningful theorems concerning the direction and 
magnitude of changes in our variables when certain data change 
In chapter in it was shown that in a large class of cases the econo 
mist derives definite theorems by means of the hypothesis that the 
equilibrium position represents a maximum or minimum position 
The inequalities associated with the definition of an extremum 
position were seen to be the source of fruitful theorems in compara- 
tive statics 

Chapter iv represented an application of this analysis to the 
cost and production theory of the firm, just as chapter v gave a 
treatment of constrained maxima as required by the theory of con 
sumer’s behavior Special aspects of the latter subject received 
treatment in chapters vi and vn And finally the study of static 
maxima and minima was brought to completion m the analysis of 
welfare economics given in chapter vm 

In the first chapter of Part II it was shown that simply from 
the standpoint of fruitful comparative statics dynamic analysis is 
useful and necessary Indeed, the correspondence principle, enun 
ciating the relationship between the stability conditions of dy- 
namics and t'he evaluation o't displacements m comparative statics, 
provides the second great weapon in the arsenal of the economist 
interested m deriving definite, meaningful theorems 

In chapter x dynamical systems are studied for their own sake, 
particularly in their stability aspects while in chapter xi I have 
discussed various fundamentals of dynamical analysis including 
f 1 problems arising out of the study of business cycles 
350 
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Broadly speaking, the development of analytical economics has 
proceeded in a natural evolutionary order. First, in Walras we 
have the final culmination of the notion of determinateness of equi- 
librium on the statical level. This has received further elaboration 
in the hands of Pareto and others. 

However, Pareto took a second, further step. He laid the basis 
for a theory of comparative statics by showing how a change in a 
datum would displace the position of equilibrium. Even earlier 
Cournot had pioneered in this “infinitesimal” analysis, although 
with reference to a narrower set of problems. 

While Pareto laid the basis- for comparative statics, his own work 
was not rich in definite theorems in this subject, precisely because 
he rarely concerned himself with the secondary inequalities relevant 
to maximum positions. On the few occasions when he did do so, 
he came to grief because of mathematical errors in their statement. 
It was left for W. E. Johnson, Slutsky, Hicks and Allen, Georgescu- 
Roegen, Hotelling, and other modem writers to begin to make 
progress along this third line. 

However, only a part of economic theory is concerned with the 
maximizing action within an economic unit. Where the inter- 
actions between individuals are concerned, the scope of fruitful 
comparative statics may be greatly extended by a fourth advance, 
the apprehension of the correspondence principle, whereby the com- 
parative statical behavior of a system is seen to be closely related 
to its dynamical stability properties. 

A natural fifth step to take after we have investigated the 
response of a system to change in given parameters is to investigate 
its behavior as a result of the passage of time. Thus, we cultivate 
dynamics for its own sake, especially with respect to the qualitative 
properties of the respective motions. 

The usefulness of any theoretical structure lies in the light 
which it throws upon the way economic variables will change when 
there is a change in some datum or parameter. This commonplace 
holds as well in the realm of dynamics as in statics. It is a logical 
next step, therefore, to begin to create a theory of comparative 
dynamics. This will include the theory of comparative statics as 
a special case, and indeed all of the earlier five subjects, but it will 
cover a much richer terrain. 

The central notion of comparative dynamics is simple enough* 
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We change something (just what need not concern us at the mo 
ment) and we investigate the effect of this change on the whole 
motion or behavior over time of the economic system under in 
vestigation It will be seen that comparative statics involves the 
special case where a 4 permanent ' change is made, and only the 
effects upon final levels of stationary equilibrium are in question 

In comparative dynamics we consider a much broader class of 
changes (a) We may make a change in initial conditions By 
definition this alters the immediate behavior of the system in a 
known way By the assumption of continuity we may infer that 
the position of the system for some region adjacent to the initial 
conditions is also altered in the same direction For intermediate 
lapses of time a separate investigation is necessary to determine 
what happens to the system However for a stable system it is 
clear by virtue of the definition of stability that for sufficiently 
long time periods there will be no final alteration in the behavior 
of the system 

(b) We may make a change in some force acting on the system 
Thus we may cause autonomous investment to vary Actually, 
there are a variety of cases which must be considered The change 
in force may be permanent, it may be intermittent, it may be 
transient or instantaneous In this very last case the analysis may 
be subsumed under the heading of a shift in initial conditions In 
the case of stable systems the response to a permanent alteration 
gives us a description of the actual path followed by a system in 
going from one 4 comparative static level’' to another 

For linear dynamic systems but only for these, the most general 
of the above variations can be thought of as made up of the cumu- 
lated effect of unit impulses, or of changes in the instantaneous 
initial conditions of the system This follows from the basic 
superposition theorem which underlies much of applied mathe 
matical analysis 

(c) Finally, there may be a change in some internal parameter 
of the system We may ask, for example, what the effect of a 
cha ige in the marginal propensity to consume or in the “relation" 
may have on the behavior of a system Again, the change in 
question may be permanent, varying transient, etc 

The nch variety of forms which the change in data may take 
matched by the numerous ways in which we can choose to de 
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scribe the “resulting effects on the behavior of the system.” Ex- 
cept in the most simple cases we shall find it necessary to summarize 
in various ways the information contained in the resulting changes 
in the system at each and every instant of time. 

From a short run point of view interest will center on the im- 
mediate reaction on the system. The answer to this question can 
often be secured by a method which is formally like that of com- 
parative statics with the significant difference that some of the 
variables treated are really dynamic in character. 

This may be illustrated by the important case in the Keynesian 
analysis where investment is taken to be a variable whose value is 
to be determined by a system of relations like those discussed at 
the end of chapter ix. For the usual Keynesian theory, in the 
short run at least, the stock of capital is taken to be constant. 
By “solving” such a system we can finally reduce our relations 
down to a single equation between the amount of investment, J, 
and the stock of capital, K, and the value of some parameter, a. 
This parameter might well be the amount of thriftiness in the 
system. Now if we are interested in the effect upon the stock of 
capital in the immediate future of a change in this parameter, the 
answer can be given by treating investment as if it were an ordinary 
statical variable and solving, as in chapter ii, our equilibrium equa- 
tions to determine the direction of change of investment with 
respect to the parameter, a. 

However, to answer our question in comparative dynamics we 
must introduce the fact that investment, which we have previously 
been treating as an ordinary statical variable, is really the rate of 
change of the stock of capital, or equal to dK/dt. If the compara- 
tive statical analysis tells us that investment is reduced by an in- 
crease in thriftiness, we can be sure that in some sufficiently short 
run the amount of capital in existence will be less than it would 
otherwise have been. For if two curves start out from the same 
point with differing rates of increase, we can be sure that the one 
with the greatest rate of increase will exceed the other for some 
small region at least. 

As it happens, a similar statement can be given in this case to 
the wider problem as to what happens to capital in the long rim as 
a result of a change in thriftiness. Thus, if instead of simply asking 
what level of consumption maximizes current investment, we 
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widen Professor Lange s question 1 and seek the levels of consump 
tion leading to the most capital at each instant of time we shall find 
that capital formation m a run of any length is only maximized if 
at each instant the Lange criteria are met 

However when the problems of comparative dynamics are 
posed in general and realistic form it turns out that the above 
identity results almost by chance Thus consider a system of tw o 
or n dynamical equations 

“ 6 (*i *. “) (« ~ 1 n) (1) 

Suppose furthermore that an increase in a alvvaj s increases dxjdt 
or that g a l is always positive Does it follow that an increase in a 
always results in a higher Xi at each subsequent instant of time? 
The answer is no For some sufficiently small instant of time 
starting out from given initial conditions this must of course be 
true but it need not continue to be true In the one variable case 
to which I reduced the Keynes Lange system the stronger theorem 
is true For m this case we may solve explicitly our differential 
equation in the form 

‘-•TiF *- 0 (2> 

and by partial differentiation of the above implicit relation between 
t x x and a it is easy to show that the change in Xi with respect to a 
t being fixed must be of the same sign as the definitely known 
coefficient g a l The reader can work out the geometrical reason 
for this 

Not only is the direct solution of the system impossible m the 
multi vanable case but the corresponding theorem is definitely 
untrue Economically this is not hard to visualize If the param 
u ka& a pit mucmced effect upt/ri the gicrwtVi of a. ‘ts&xnA tosi 
able after a sufficiently long period of time this indirect influence 
may outweigh the direct favorable influence on the first variable 
The reader may wish to w ork out an even more complicated model 
than the above in which a enters as a dynamic parameter Thus 
it is not difficult to construct a model in which deficit financing has 

1 0 Lange The Rate of Interest and the Opt mum Propens ty to Consume 
ca V (1938) 12 32 
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a favorable effect upon the growth of capital in the short run, but 
the accumulated bad effects of the growing debt are adverse to 
capital growth. I do not wish to pass judgment upon the reality 
of the above assumptions, but simply wish to point out the possible 
occurrence in economic systems of the common medical phenomena 
whereby short term remedies may have long term deleterious 
effects. 

Of course, if we confine our attention to the behavior of long 
period stationary equilibrium positions, the methods of compara- 
tive statics come into their own again. All time derivatives, differ- 
ences, etc., are set equal to zero and the resulting system solved 
like any statical system of chapter ii. 

Finally, the economist will often be interested in the effect upon 
some characteristic of the motion of the system. How will a 
change in the “relation” affect the average level of a fluctuating 
trendless system, its periodicity, its dampening, and its amplitude? 
Nor need we confine ourselves to simple stable systems. Thus, it 
is of the greatest interest to know how a change in age-specific 
mortality will affect the net-reproductive-rate; or how the period 
of delay in the expenditure of income will affect the rate of inflation 
under a given defined inflationary gap. 

The further development of analytical economics along the lines 
of comparative dynamics must rest with the future. It is to be 
hoped that it will aid in the attack upon diverse problems — from 
the trivial behavior of a single small commodity, to the fluctuations 
of important components of the business cycle, and even to the 
majestic problems of economic development. 



MATHEMATICAL APPENDIX A 


Many mathematical theorems of importance to the economist 
are not to be found collected in any one convenient place. With- 
out striving for rigor, I have gathered together some of those most 
relevant to the matters under discussion in the present work. 


I. Maximum Conditions 
Let 


for a Single-Variable Function 

2 = f ( i ) 


be a defined function with a continuous second derivative every- 
where on an interval 


z will enjoy a relative maximum at the point t°, providing 

M ==/(*»), a < t° < b (1) 

for neighboring values of t. 

It is necessary in order that this condition be satisfied that 

fin = o, ( 2 ) 

and 

fit 0 ) =i 0. (3) 

The first of these conditions is derived in the following manner. 
By the Theorem of the Mean, 

fit) - f{t°) = {t- t°)f[t° -f S{t - /»)]. 0 < 6 < 1 

Suppose 

fit 0 ) > 0. 

Then from our continuity assumptions there exists an interval 

j / — t° | < m 

within which the following inequality holds everywhere 

fit) > 0. 

Hence, for all such neighboring values of t larger than t°, we have 

m - m > o, 
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which contradicts our hypothesis. Similarly, 
f(t°) < 0 

would lead to a contradiction. Therefore, 

fin - o. 

The second necessary condition is derived in a similar manner. 
From Taylor's expansion with a remainder we have 

m - m = (< - (”)/«") + (l ~ 2 + nt - m 


nn = 


But 


and so this becomes 

m - m - (i + *'(< - <•)]■ 

Suppose 

/"(<°) > 0. 

Then by assumed continuity 

no > o 


0 < 0 ’ < 1 


0 < 0 ’ < 1 


l* - P\ < m'. 


on an interval 
In consequence, 

m - m > o 

on this interval. But this is a contradiction. Therefore, 

/"(t°) 0. 

Actually, it can be shown under conditions of proper continuity 
that the first non-vanishing derivative must be even and negative 
i bra reihtiVe maximum. 

I define a regular maximum as one for which the equality sign 
in equation (3) is excluded. It is necessary and sufficient in order 
that z enjoy a regular maximum at t° that 

nn = o, 

and 

< 0. (4) 
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Similarly, necessary conditions for a relative minimum under 
our assumed conditions are 

/'(*°) = 0, 

and 

^ 0 . ( 5 ) 

Necessary and sufficient conditions for a regular minimum are 

f(t°) = 0, 

and 

/"(*°) > 0 . ( 6 ) 


II. Maximum Conditions for a Function of 
Many Variables 
Let 

Z = fix 1, • • • , x n ) 

be a defined function with continuous second order partial deriva- 
tives of all kinds in an open region S'. Then a point (X°) in S 
affords a relative maximum to z provided that 

f(x i, •••,£„) ^ f(x i°, • • • , x n ° ) 

for sufficiently close values of ( X ). 

It is necessary in order for this to be true that 

df{x 1°, • • • , X n °) = n 
dX i 

df(xi°, • • • , x n °) _ 
dx 2 


df(x 1 °, • • • , Xn°) = 

dx n 


= 0 


and 

+ d2f - 


dxidxi 


+ 


dx x dx 2 

dj 

dx 2 dx 2 


hi 2 +2 


dxidxz 

dj 

dx 2 dx z 


h x h z + • • • + 2 

h 2 hz + • • • + 2 


+ 


+ 


dXidXn 

d 2 f 

dx 2 dx n 

dj 

dXndXn 


hiK 

h 2 h n 


h 2 

ff'Tl 1 
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where (ft i, ftj, , ft») are arbitrary numbers, and the partial de 
nvatnes are e\aluated at the point (A'°) 

If we adopt the following notation 


/.• - /.°(*i°, 

and 


, . a’-ff*.*. . *.“) 

dXidXj 


(* - 1, , n) 


then our necessary conditions can be written symbolically 

/.° = 0, (* - 1. , «) (7) 

and 


£ £/.,"«, s o 

i i 


( 8 ) 


It is necessary and sufficient for a regular relative maximum that 


and 


/<•- 0 , <« = ], ,») 


£ 2/«**A < 0 (9) 

1 l 

for not all ft’s equal to zero In addition to the vanishing of all 
first partial derivatives a regular relative maximum requires that 
the quadratic form whose coefficients are the second partial deriva 
tives be negative definite This last condition is the generalization 
to many variables of the single variable condition that the second 
derivative be negative for a regular relative simple maximum Its 
meaning will be amplified m a later section 

The proof of these necessary conditions can be easily demon 
strated by the use of a device which enables us to convert our 
problem into that of a single variable one upon which we can 
bring to bear the results of the previous section 
Let 

x, = xf + th { , (i — 1 , , n) 

where the ft’s take on arbitrary values Clearly for a relatn e 
maximum we must ha\e 


fix 1® + th u X , 0 -f th t 


,x H ° + th o m/(x 1°, 
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For given values of the h’s, not all equal to zero, / can be regarded 
as a function of the single variable t, and the above condition can 
be written 

m ^ m. 

From the previous section we know that this requires that 

no) = 0, 

and 

/"( 0) =£ 0. 

But, by performing the indicated differentiation, we find 

no) = £ fi°hi = 0, (10) 

1 

and 

/"( 0) = 0. 

1 1 

In order that (10) be satisfied for arbitrary values of the h’s, each 
partial derivative must vanish, or 

fi° — 0- (* = 1, • * •, n) 

For a regular relative maximum, we get by this method 
fi° = 0, (i = 1, ■■■, n) 

and 

Yj Y fn°hihj < 0, not all h’s = 0, 

1 X 

these conditions being both necessary and sufficient. 

Similarly, a regular relative minimum requires that 

fi° = 0, (i = l, •••,») 

and 


Y Y fifhihj > 0, not all h’s = 0. (11) 

1 1 

The appropriate quadratic form must be in this last case positive 
definite. 
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III Conditions for a Constrained Maximum 
As before, let 

Z = /(*. , X.) 

be a defined function with continuous second partial derivatives in 
an open region S In addition, we impose subsidiary conditions 
on the admissible values of our variables in the form of the following 
m implicit functions of the n variables, defined in 3 1 with continuous 
second partial derivatives, 

G'(xi, , x n ) = 0, 

<?(*,, x n ) » 0, 

(m < n) 

G m (x 1 , , Xn) = 0 

Under what conditions will a point (X°) m S afford a relative 
maximum to z, and satisfy the subsidiary conditions' 1 Such a point 
is defined as follows 

/(*i, ,**)S/(a :i # . .*„•>, 

where 

G k (x h , x n ) = G k (x A , x n °) =0, (k = 1, , m) 

and (.30 sufficiently close to (X°) 

The way to solve this problem which would suggest itself most 
naturally would be to use our m subsidiary conditions to eliminate 
m of our variables by expressing them in terms of the remaining 
(n — m) variables We assume that the rank of the matrix 


or CGv‘3 (ft *= 1, ,m) 

(i = 1, , n) 


is of order m so that at least one m 2 determinant does not vanish 


d& 

d& 

dG 1 

dxi 

dx 2 

dx n 

dG 2 


dG 2 

dxi 

dx 2 

dx n 

dG” 

dG m 

dG • 


dx 2 dx n 
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This insures us that we can in fact uniquely solve our subsidiary 
conditions for m variables in terms of the remaining (n — m). 
Since the numbering of our variables is arbitrary, we may write 
this without loss of generality as 


1, Xm4-2i ", X n ). (t — 1, '**, 

By substitution in our original expression, z can now be expressed 
in terms of the variables ( x m+ i , x m+2 , • • • , x„) with no conditions of 
constraint, and the analysis of the previous section can be applied. 

However, there is a great loss of symmetry in such a procedure 
since not all our variables are treated alike. Fortunately, by the 
use of an artifice which can be rigorously justified, it is possible to 
derive a more symmetrical set of conditions. 

Consider the Lagrangean-multiplier expression 

H = H(x i, •■■,x n ) = f(x i, • • • , x n ) 

+ XjGKSl, X m G m (x 1, • • • , X„), 


where the X’s are constants, not all zero, whose values will be de- 
termined later. Obviously for values of (X) satisfying the sub- 
sidiary conditions indicated above, all terms after the first must 
vanish, making our new expression equivalent to z. This suggests 
that for a maximum at (X°) 


dH° df° dG 1 " Q 

HxTi = Hi = ^ + Xl ^7 + ‘ +Xct dxi ~ 0, 


(* = !,•••,») ( 12 ) 


subject to 


G k (xi°, • • • , jc„°) = 0. (k = 1 , 


It is unnecessary for our purposes to indicate explicitly the rigoro”s 
proof of this statement. 1 

The last two sets together constitute ( n + m) equations 
in the {n + m) unknowns (xi, • • •, x„, Xi, • • •, X m ). Under the 
proper restrictions the values of all the x’s and X’s are uniquely 
determined. 


1 See CarathfSodory, Varialionsrechnung und Parlielle Differentialgleichungen Erster 
Ordnung (Leipzig und Berlin, 1935), pp. 364-189. The conditions of (12) are identical 
to those of (13) which follow. Carat he odory shows that if some minor of (13) greater 
than t n were non-vanishing, then one could — by the implicit function theorem solve 
for some x in terms of /, where / is both greater and less than the extremum value, /°. 
This is a contradiction. 
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Hit above first necessary conditions cm be expressed 


pendently 

of the X s 

Consider the matrix 

df 

aG 1 

aG* 

aG" 


dxi 

a^i 

dXi 



df 

a# 

dGf 

aG" 


dx< 

dxt 

dXz 

a** 

ra/ aGH 





’ or [ax, ■ dx, J 

df 


dGf 

aG" 


dX n 

dx n 

dX n 




mde 


n) 


The rank of this matrix must be equal tow ie, every ( m + 1)’ 
determinant must vanish Thus 


2L *21 n (* = 4 m + 1) 

dx t dx t “ (k ** 1 , m) 


(13) 


and so forth for all possible combinations of (n) variables taken 
(m + 1) at a time These will yield (n — in) independent condi 
tions on the partial derivatives of / and the G s 

For concreteness consider the case of a many variable function 
to be maximized as of one subsidiary condition of restraint Ac 
cording to the conditions indicated above there will result at an 
extremum tmgency between the locus of restraint and a contour 
locus of z (n — 1) conditions familiar in many branches of eco 
nomic theory 

In addition to the first necessary conditions upon first partial 
denvatives there are as m the previous sections certain added 
secondary conditions on the second partial derivatives The fol 
lowing must hold 


E £ll,°hh t 

i i 


« £ £ 


' ay* 

v dx dx. 


+ X 


W 

1 dXidXj + 


+ Xm 


d*G* \ 
OxdxJ 


h hj < 0 


(14) 


k~h.-0 (*- 1 ,m) 


for 
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In summary, it is necessary and sufficient for a regular relative 
constrained maximum that 

Hi 0 = 0, (i = 1, •••,«) 

and 

E E Hif'hihj < 0 

1 1 

for 

» dG k " 

E ^7 hi = 0, not all /ds = 0. 

The meaning of these secondary conditions requiring the defi- 
niteness of certain quadratic forms under linear restrictions will be 
dealt with later. 


IV. Quadratic Forms 

A real (homogeneous) quadratic form in n variables (Ji u • • •, h n ) 
is defined as follows: 

Q(h 1, • • •, h n ) — (iiihi 4 * a-nhih-y +•••-{- CLi n hih n + 
a-nhdli T fl.22^2 2 T ' ' ■ T Q-2nhihn 4 " 

( 16 ) 


d n ih n hi 4 - cLn7.hn.h2 4 " * • * 4 * 

where the a’s are real constants. A quadratic form in the single 
variable x would take the form 


Q(x) = ax 2 ; 

in two variables (x, y) it would be written 

ax 2 4 - 2b xy 4 - cy 2 . 

Since the order of multiplication is indifferent, it is clear that 
any quadratic form can be made symmetrical, i.e., 

a i} — aji; a = a’. ( 17 ) 

For if not already symmetrical, we may rewrite it without affecting 
its value as 
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which is symmetrical Unless otherwise specified we shall deal 
only with symmetrical quadratic forms 

A positive definite quadratic form is one which is always positive 
e\cept when all the variables equal zero (at which point all quad 
ratic forms vanish) A negative definite quadratic form is one 
which is always negative except at the origin A definite form is 
positive definite or negative definite A semi-definite form is one 
which never changes sign but may be zero at a set of points other 
than the origin Thus a positive semi definite form could more 
appropriately be called a non negative form All forms which are 
neither definite nor semi definite are called indefinite 

A quadratic form in n variables is singular if the rank of its 
matrix 2 

« - M 

is less than »,ie if the determinant 

l«ul = 0 

It may be observed that all semi-definite forms are singular 

Forms m less than three variables 

In the case of a single variable, it is easy to determine the 
definiteness of 

Q(x) = ax 2 

The form is (1) positive definite, (2) semi definite and singular, or 
(3) negative definite depending upon whether 
a § 0 

It is more difficult to ascertain whether a form in two variables 
is positive definite If 

Q = ox 2 + 2 bxy + cy" 1 

is to be always positive except at the origin (0, 0), it must clearly 
be positive for (1, 0) But 

(3(1, 0) = o, (18) 

and so a necessary condition is 

a > 0 (19) 

* A matrix 13 of rank r if atl its determinants of order (r + I ) vanish All detemu 
nants of a still higher order will be found also to vanish 
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Similarly, it can be proved that 

c > 0. (20) 
These conditions are necessary but not sufficient. To derive 
still another necessary condition and determine complete sufficient 
conditions, let us consider the form as a quadratic equation in x 
with parameter y, 


fix) = ax' 2 + (2 by)x + icy 2 ) = 0. 

If the form is positive definite, # must not vanish for any non- 
vanishing y. Solving by the familiar rule for extracting the roots 
of a quadratic, we have 


_ _ 2 by Az fAb 2 y 2 — 4 acy 2 _ f — b ± fb 2 — 
2 a a 


But this must yield no real value of x or the form would not be 
positive definite. Hence, the expression under the radical must be 
negative, and we have as an additional necessary condition 


ac — b 2 > 0. 


( 21 ) 


However, our three necessary conditions (19), (20), and (21) are 
not all independent. The last of them in conjunction with either 
of the first two implies all three. The sufficiency of these condi- 
tions is implied by the fact that the form is positive for the values 
(1,0), zero only for (0, 0), and never changes sign; consequently, 
it is positive definite. 

These conditions may be given a more convenient form. The 
matrix of the form in two variables is obviously 



If we define a principal minor of a determinant as a subdeterminant 
formed by crossing out the same rows and columns so that the 
diagonal elements of the minor are contained in the diagonal ele- 
ments of the original matrices, then our necessary and sufficient 
conditions are implied in the statement: all principal minors must 
be positive. These are three in number, but not being all inde- 
pendent, a complete set may be specified by making successive 
principal minors positive starting with the upper left-hand comer 
and adding successively corresponding rows and columns. (In this 
case there is only one row and one column to add.) 
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I or the form to be negative definite it could be shown along 
similar lines that 


a < 0 
la b I 
U c\ 


> 0, 


( 22 ) 


or that the first principal minor is negative, and the successive ones 
alternate m sign those of odd order being negative, and those of 
even order being positive This is because [ a is negative definite 
only if [ — a is positive definite 


Torms in any number 0 } variables 

The special methods applicable to forms of few variables cannot 
be applied in the general case We must therefore seek more 
general methods A non singular transformation of variables of 
the type 

>*. “ £ b„pi, (r * 1 , n) 

1 

or using matm notation, 

h — bp, p = b~ l h, 

leaves the quadratic form positive definite if originally positive 
definite This is made plausible from the recognition that the 
origin goes into the origin, and that every point in the h space is 
uniquely related to every point in the P space Consequently, Q 
after the transformation takes on only the \ alucs taken on before 
Q{h) = Q{bp) - Q(p) 

In particular let the transformation b be such as to take Q into 
a sum of squares of the form* 

Q(bp) = &xpi* + d,pS + + (23) 

It is well known that an infinity of transformations exist which will 
do this 


1 In matrix notation 


b must be chosen so that 
Y&i 0 


Q = hah 

* P ( bab)p where h «= bp 


= w<il 


_ f 1 for t - J 1 

l 0 for t 9 * j J 
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Our problem is for the moment reduced to the simpler one of 
determining the definiteness of a quadratic form which is a sum of 
squares. Clearly this is positive definite, if and only if 

> 0. (i=l, •••,«) (24) 

By the famous Sylvester's Law of Inertia it is known that the num- 
ber of plus and minus coefficients in the a’s is independent of the 
transformation employed. Clearly, therefore, the a’s, if all posi- 
tive after a particular transformation, are all positive for any 
possible transformation. 


Transformation into a sum of squares 

Two out of the infinity of possible transformations are of par- 
ticular interest since they show definitely what conditions upon 
the matrix a are implied by its being positive definite. First, there 
exists a transformation b — gotten by an extension of the method 
of “completing the square” — such that 
£1 = flu 




^3 — 


Oil 

012 


021 

022 

2 

Oil 

~ * 

Oil 

012 

Ol3 

021 

022 

O 23 

Oal 

0 32 

O33 


On 

O21 


O12 

0-22 


A 3 

A 2 


(25) 



On O 12 

O 21 O 22 

On 1 O n 2 

• - ’ Oi„ 

O 2 n 

■ ' * O n n 

A n 

Oil 

O 12 

O'l.n—l 

T 

c 

1 

021 

O 22 

02,n-l 


On— 1,1 O n — 1,2 

O n - 

-l,n— 1 
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It is necessary and sufficient, therefore, that these ratios are all 
positive 4 But 

A 1 = an = Q( 1, 0, , 0) > 0 

Hence, a necessary and sufficient condition that a quadratic form 
be positive definite is that the principal minors (A 1 , , A” ) must 
be positive 

Minimum properties of Q 

Some preliminary observations are in order before discussing 
explicitly the second transformation Because a quadratic form 
is homogeneous of the second degree in the V s if its sign is always 
positive on the unit hypersphere defined by 

hi* + W+ -Hi. 1 « 1, (26) 

it is positive everywhere except at the origin 6 There is no loss 
in generality therefore, in considering the sign of Q subject to the 
side condition (26) 

Since Q is everywhere continuous with continuous derivatives 
it necessarily attains a minimum on the closed hypersphere For 
the form to be positive it is necessary and sufficient that the minimum 
of Q on the unit hypersphere be positive „ because then, and only then, 
is it positive everywhere except at the origin 

This minimum can be calculated explicitly Consider 

Q = E E a»h Jij 

1 1 

* Cf H Hancock Theory of Maxima and Minima pp 89-91 
‘This is to be seen as follows If Q were negative for any value of A(“> h) not 
necessarily on the unit hypersphere it would be negative for some value on the hyper 
sphere Suppose 

21 X 1 ■» r* 

Then the point (hi I t , h*/r) is on the unit hypersphere and 


so that their signs agree 



subject to 
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£ ^ -1 = 0. 
x 

Form the expression 

H = £ £ ciijhihj - - 1 ). 

11 1 

According to the previous section it is necessary for a minimum that 

2 £ dijhj 2\hi = 0, (f — 1, • • • , 7i) 

or 

£ («<i ~ = 0. (t = 1, • • •, ») (27) 

1 

These are linear homogeneous equations which possess a non- 
vanishing solution if, and only if, the determinant 

D(\) = \a t} - X8,-y | = 0. G (28) 

This is an »th degree equation in X and will possess n roots, some 
of which may be repeated. For simplicity, we assume them to be 
distinct. Let us indicate these by 

Xi < X 2 < • • • <C X„, 

At least one must be real since a real minimum does exist. Ac- 
tually, by a well-known theorem of Hermitian matrices of which 
our symmetrical real matrix is a special case, all are necessarily 
real. 7 Corresponding to any X*, there exists a unique solution 
(hi k , • • • , h n k ) satisfying equations (27) and lying on the unit hyper- 
sphere defined by (26). 

Also, multiplying the first equation of (27) _by h k , the second 
by hi k , etc., and adding we find 

X* = £ £ ajit-hf- = Q{h k ). (A - 1 ,•■■,») (29) 

1 1 

It follows that the smallest of the X’s, Xi, equals the minimum of 

5 This is called a secular equation, a delerminantal equation, and the characteristic 
equation of the matrix a. Its roots are the latent roots of the matrix a. 

7 H. Hancock, Theory of Maxima and Minima, pp. 107-109. See also the discussion 
on orthogonality in footnote 9 below. 
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the quadratic form Hence, a necessary and sufficient condition that 
a quadratic form be positive definite is that all the latent roots of its 
matrix be positive 8 

Similarly a form is positive semi definite if none of the X’s is 
negative, although one or more is zero A form is negative definite 
if, and only if, all the \’s are negative 

Second transformation into a sum of squares 

This can be proved in still another way Consider the trans 
formation 

pk=£h*h, (k = 1, 

1 

It is orthogonal, 1 e , 9 

ip: = in:, 

1 1 

i h*hr = h kn 

1 

•licanbe shown that X t is the minimum of Q o \ the set of points lying on the unit 
hypersphere and satisfying the constraint defned !>' 

hi*hi + hfh + + A, A, = 0 

If we adjoin still another constraint 

hfh, + hfh, + + A.’A. - 0 

Xi minimizes Q Generally X, minimizes Q subject to 

£*•- 1 

i h ih - 0 0-1 ' - 1) 

* In matrix notation 

Orthogonality is defined by 

bb = / therefore b = b 1 
Therefore p •= bk impi es h ** b p 

To show that any two different h columns are orthogonal note that 
ah* ** Xi I* ah> ■= \jh 

and that 

0 = (*') ak* - (k*)'ahi «= (X* - X )(A*) h 1 

If the two roots are unequal the very last product musts anisli and the/i s are orthogonal 
Because two vectors consisting of conjugate complex numbers cannot be orthogonal 
this shows the truth of the previous assert on that all roots of a symmetrical matrix 
must be real 


, n) (30) 

(51) 
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' hi = £ hi k p k , (i = 1 , • • •, n) 


(32) 


and 


n n n it 


Q(P) = I E a-ij E E hppjif'p,, 

ii ii 


= E E (E a ij h i k hj n )pkP m '° 

i i i 


= E 
1 


(33) 


This shows that the original quadratic form can be transformed 
into a sum of squares whose coefficients are the latent roots of the 
matrix a. This furnishes a second proof that the \’s must all be 
positive if the quadratic form is positive definite, and conversely . u 


Characteristic equation of a matrix 

Let us consider in more detail the wth degree determinantal 
equation 

D(X) = | an - X5,vi = (Xi - X)(X 2 - X) • • • (X„ - X). (34) 

Expanding out the determinant, this becomes 

(— X)’ 1 + Ci(— X)' 1-1 + • • • + c„_i(— X) + c n = 0, (35) 

10 Proof that the expression in parentheses above equals \k is derived by multiplying 
the tth equation of (27) by hi m and summing. This gives 

n n n 

£ £ a H hfhi m = Xfc X) ht k hi m = \kfom 

i i i 

because of the orthogonality property of the transformation b. 

11 The above discussion implicitly assumes that all latent roots are distinct. If they 
are not. the matrix a can still, because of its symmetry and linear divisors, be transformed 
into a diagonal matrix whose elements are the latent roots, but not by a unique orthogonal 
transformation. 
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where 
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ci — an + fljs + + c„„ = sum of all it principal minors 

of the first order 


Ci 


On On ^ On a n 
On Ojj Oil On 


sum of all n(« — l)/2 
principal minors of the 
second order. 


(36) 


c r “ sum of all w'/r'fw ~ r ) f principal minors of the rth 
order 


C„ *= |o„| = D( 0) 

From the familiar relationship between the roots and coefficients of 
a polynomial we have 

Ci — Xi + Xj + 4~ X« = on + O 11 + + 0»n 

Ct = XiX* + XiXj + 4- XiX. 

4- XiX| + + X*Xn 

+ + X._ t X. 


(37) 


c t - sum of all possible products of (Xi, , Xj taken r 
at a time 


c.-M-D( 0) =X!X, X b >0 

Descartes’ rule of signs requires that all the c's be positive if 
all the X's are to be positive, and conversely It is clearly sufficient 
that all principal minors be positive 

It is obvious that definiteness of the quadratic form Q implies 
definiteness of all quadratic forms made up of subsets of the van 
ables, for these are simply special cases when some of the h’s are 
set equal to zero The latent roots of the matrices of all such 
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subsets must be positive. From a consideration of the minimum 
properties of the X’s given in footnote 8, it follows that any such 
latent root X* satisfies the relation 

Xi SI X* X n . (38) 

From this fact we have another proof of the necessity that all 
principal minors must be positive since the product of the latent 
roots of a submatrix equals a corresponding principal minor. 

Definiteness of inverse matrix 

Repeatedly in economic problems it is important to be able to 
evaluate the principal minors of the reciprocal or inverse matrix 
ar 1 when a is known to be definite. 12 The following theorem is, 
therefore, very useful. If a is the matrix of a positive definite form, 
then ar 1 is the matrix of a positive definite form. Let 

Q = h'ah. 

Perform the transformation 

h = a-'p, (39) 

where a = a' because of the symmetry of the matrix a. Then 

Q — p' a~~ v aa~ l p = p'a~ v Ip ------ p'a~ l p. (40) 

This shows that the original quadratic form can be transformed 
into one whose matrix is the inverse of the original matrix. From 
our previous discussion this must be definite. 13 

13 

aT 1 = 


where A is the determinant of the matrix a, and An is the cofactor of the element in the 
ith row and jth column of the determinant A = |a,-,-| • a -1 is termed the inverse of a 
because the transformation a -1 following the transformation a is equivalent to the 
identity transformation I because of the property 

aar 1 = a~ l a = I. 

Only non-singular matrices have inverses. 

1S Sylvester’s Law of Latency states a more general relationship. The latent roots 
of /(a) are equal to /(X) where the X’s are the latent roots of o. This follows from the 
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V. Quadratic Forms Dehnitl under Constraints 
Let us now derive necessary and sufficient conditions that a 
quadratic form 

Q = E Z a>Xk } 

i i 


is positive definite for values of h satisfying m( < n ) linear condi- 
tions of constraint 

£gfh, = 0, (a = 1 , • • • , m) ( 41 ) 

i 


where |gj a | is of rank m>* It would be possible of course to 
eliminate m variables by use of the equations of constraint, sub- 
stitute in Q, and apply the theory of the previous section to a 
quadratic form in (n — m) variables. But this is lacking in 
symmetry 


following identities 

ax = \Ix 
f{a)x = /(A/)* 

- /(A )Ix 

Where /(a) *= a~‘, the theorem above emerges as a special case In the above theorem 
/ must be a rational function 

14 It is, of course, sufficient that a be the matrix of a positive definite quadratic 
form, since, if it is positive everywhere, it is clearly positive for the subset of values 
satisfying the m conditions of constraint But this is not a necessary condition The 
failure to recognize this was responsible for technical errors in economists' treatment of 
utility theory up until Slutsky s classic treatment of the subject 
The maximum of 

,*„) 

subject to the side condition 

E p,xi = I 

does not require as secondary conditions that the matrix of the Hessian 
Zv.il 

be a negative definite form In fact, the definiteness of this matrix is not invariant 
under a general monotonic transformation of the utility index of the form 
FM 
Fi = Fn 

Ft, = F 1 <p,j + F'v.Vn 

eas the correct conditions later to be developed are mvarnnt. 
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As in the previous section, there is no loss of generality in con- 
sidering the behavior of Q on the unit hypersphere defined by 

E hi 2 -1=0. (42) 

* # 

This additional side condition is adjoined to the m equations of 
(41), and Q is to be minimized. Form the expression 

aahihs + 2 £ E gfhj -HE h? - 1), (43) 

11 11 i 

where X and. the n’s are Lagrangean multipliers. At a minimum 
we must have 

Hi = 2 (tauhj + E F«gi° ~ =0, (* = 1, • • • , n) (44) 

1 1 

or 

2) ( a ij 4” E Fagi a — 0, {i =• 1, • • • , n) 

1 1 


E gi a hj = 0 , 


1 


(a = 1, ••■,*») (45) 


E K - 1 = 0 . 

1 


A non-vanishing solution of these equations exists if, and only if, 
the following bordered determinant vanishes, 


An(X) = 


0,7 X 0 ij gi a 

gi a 0 

= (- 1 )». 4 (X - Xl)(X - X 2 ) • • • (X - Xn— m) 


(46) 


where A > 0. The roots of this (n — m)th degree equation are all 
real because of the symmetry of the 'determinant. Let them be 
written in algebraic order 


Xi = X2 = • ' • = Xn_m. 

Then if the form is to be positive definite under the above con- 
straint, the smallest root, Xi, easily shown to be equal to the abso- 
lute minimum of Q under constraint, must be positive, and con- 
versely. Hence , it is necessary and sufficient in order for a form to 
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be positive definite that all roots of the above bordered determinant 
equation be positive w 

Recalling that the roots art all real and applying Descartes’ 
rule of signs, it can be shown as m the previous section that the 
following bordered principal minors must all be of sign (— l) m 


0 < (- 1W = (- i)"|2” 


£<r 

0 


(*, 7 = 1, , r) 

(a, / 3 = 1, ,m) (47) 

m + 1 Srsii 


These in — m) conditions are both necessary and sufficient 

If the form is to be negative definite under constraint, the de 
termmants B' must alternate in sign, the first one being of sign 
( — l) m+l , the second of opposite sign, etc , 1 e , 

(- \) T B r >0, m + 1 <rg« (48) 


The inverse of the bordered matrix 

It is of considerable importance for many economic problems 
to know what is implied with respect to the inverse or reciprocal 
of our matrix The latter can be written in matrix notation 

[nil a,„ gi l gi” ] 



[gr gn m 0 0 J 

The matrix has been partitioned into four submatrices, one is n 
by n, another m by n, still another n by m, and the last m by m 
The matrix is symmetrical, 1 e , 

R - R! (50) 


18 The form is negative definite if and only if all are negative If the roots d ffer 
sign it is indefinite If one or more are zero but the rest are of the same sign it is 
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The inverse or reciprocal matrix is defined by 
RRr 1 = I — identity matrix. 

It can be written 



g' 

~ d 

_g 

Lg' 

0. 

T 

2. 


379 


(51) 


Performing the operation of matrix multiplication, we find 


[ad + gg' 

ag + gq~ 

= 

‘J 

O' 

l g'd + 0 

g'g + o J 

.0 

J. 


= I. 


(52) 


This implies the following four matrix identities: 

ad + gg' = I, 
ag + gi = 0, 
g'd = 0, 

g'g = /• 


(53) 


It can be shown that a is the matrix of a positive semi-definite form 
provided Q is positive definite under constraint. It is only of rank 
{n — m ) ; i.e., all determinants of higher order than [n — m) vanish. 



MATHEMATICAL APPENDIX B 

THE THEORY OF DIFFERENCE AND OTHER 
FUNCTIONAL EQUATIONS 

1 In the first mathematical appendix I have included a very 
brief summary of such theorems as play an important role in eco 
nomic analysis However, it seems desirable to give a fuller treat 
ment of difference equations and other functional equations First, 
although in many ways the theory of simple difference equations 
is more elementary than that of differential equations it is not 
customarily taught m mathematics courses, and no convenient ex 
positions of the subject from the standpoint of the economist are 
to be found 1 Second just as differential equations are of cardinal 
interest to the physicist, so difference equations are of the greatest 
importance to the dynamic economist, and hence their exposition 
deserves a prominent place 

A Single Equation 

2 A difference equation can be written as an implicit relation 
of the following form 

f[y(t + n) y(, + n - I) , y(l) (] = 0 (1) 

This will be said to be of the nth order because it involves the 
variable y at n different (equally spaced*) periods of time It may 
be noted that the fact that the y values are defined for / values 
differing by an integer does not really imply any restriction for 
if the equally spaced values differ by any quantity, we may redefine 
ucn time ‘varraVre so as to ma'xe this di'fference equa'i to witty 
Although the values of t differ by an integer, it is not necessary 
that t take on only integral values However, for purposes of eco 

1 The economist may consult with profit G Boole Treatise on the Calculus of Finite 
Dtfft ences (London 1880) L M Milne Thomson The Calculus of Finite Differences 
(London 1933) P M Batchelder An Introduction to Linear Difference Equations 
(Cambridge Massachusetts 1927) N E Norland Vorlesungen uber Differen enrechnung 
(Berlin 1924) for a more advanced treatment and bibliography 
380 
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nomics it is usually only necessary to consider the values of y for 
integral values of time. By breaking up our time into small enough 
units we can keep the error involved in this procedure down to any 
requisite level. As we shall see, this greatly simplifies the mathe- 
matical theory and makes it resemble very closely the theory of 
differential equations. 

Equation (1) is an identity in t. The reader will see from this 
that it may be written in very many ways, e.g., as a single relation 
not between [ ~_y(t + »), • • •, y{t)~], but as a single relation between 
Qy(0> 1 • y(t — »)], etc. However, because equation (1) contains 
as its last factor the explicit term t, rewriting our difference equa- 
tion in the manner suggested will involve a slight modification of 
the form of the / function. Non-historical systems, which are of 
the greatest interest in dynamic economics, do not involve time 
explicitly and therefore omit the last t in equation (1). 

3. Difference equations receive their name because they involve 
the value of a variable at different points of time. As written in 
equation (1), however, there is no direct use made of the notion of 
finite differences A n y, defined in books 2 on interpolation as follows: 

Ey(t) = y {t + 1) - y(t), 

A 2 y(t) = A y{t + 1) - Ey{t) = \y(t + 2) - y{t + 1) J 

- \y{t + 1) - y{t ) } 

= y{t + 2) - 2 y(t + 1) + y(t), 

• (2) 

A n y(t) = A n ~'y(t + 1) - A n ~ l y{t) 

n 1 

- <-l)W + *-«•> 

A °y(t) = y(t). 

It will be intuitively clear that a knowledge of n adjacent values 
of y will permit us to calculate the value of y and of its first {n — 1) 
differences. (See equation 6 below.) Therefore, by substitution 
into equation (1) we can always write it in the following form: 

E[A n y(/), A n_1 y(/), • • A y(t), y{t), t ] = 0, (3) 

2 See E. T. Whittaker and G. Robinson, The Calculus of Observations (3rd ed., 
London: Blackie, 1940), chaps, i-iv. 
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where F is related to / as follows 
F£A n y, , Ay, y, 0 ~ fly 4- nAy 

+ — ” 2 ~ A *y + + A*y, , v + Ay, y, /] (4) 

Not only can every difference equation like (1) be written as (3), 
but also conversely, because of the relations given in (2), every 
equation of form (3) can be written as form (1) Thus the two 
forms are completely equivalent, and we can choose between them 
on the basis of conxemence Actually, the form of equation (1) is 
most frequently encountered in the literature, although later it wall 
be shown how either may be handled 

4 Their relationship to each other may be further revealed if 
we examine brief!} the notion of an operator Thus, A may be 
thought of as a difference operator defined by the first equation 
of (2) In a similar way we may define the differential operator 
D(= d/dl) by the relation 

Dy(t) = y'(t), D*y(t) = Dy'(t) = y"(t), etc 
In the last century the English mathematician Boole, following up 
the earlier fruitful suggestion of Leibnitz, employed the shifting 
operator E defined as follows 

Ey(l) = y(t + 1), £*>(/) - y(t + 2), etc 

The success of the operator method stems from the recognition that 
these may be treated as if they were algebraic quantities Thus, 
leaving out the y function completely, we may write out the 
operator identity 

E — l + A (5) 

Treating these as algebraical expressions subject to the usual bi 
normal theorem we dem e immediately the relations 

E- = (1 +A)» = 1 + nA + ' ,( ” ~ 1) A» + 

+ ’ (6) 
A" = (E - 1)" = E" - n E-* + + 

+ <- + 

ote the agreement of the second of these with equation (2), and 
the use of the first of these in connection with equation (4) 
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Exercises: 

1. The algebraic deviation of the price of hogs from the equilibrium value 
is in any year £ the deviation of the previous year, but of opposite sign. Set 
up a difference equation for the price of hogs. Intuitively is the process stable 
or unstable? (Hint: Use as your variable the deviation of price from the equi- 
librium value.) Write out the first few terms of the sequence. 

2. Form a difference table of the numbers 2, 4, 8, 16, 32. How many 
difference columns can you fill in with the given numbers? What is the general 
rule relating the number of columns of the difference table which can be derived 
from a given set of numbers? Note the triangular form. 

3. Write the difference equation of the first exercise in the form of equation 
(3) rather than (1). 

4. Write the difference equation y(t + 2) — a(l + (3)y(l + 1) + ctfiy(t) = 0 
in terms of the E notation and also in terms of the A notation. 

5. What is the meaning of £ -1 ? 

6. The receding difference operator V is defined by Vy(l) = y(t) — y{l — 1). 
Give its relationship to E~ l , corresponding to the relations of equation (5). 

7. Write out the usual Taylor’s expansion for y(t + 1) in terms of y(t) 
using the D notation. Verify the relation e D — E. (Hint: Use the series 
expansion e x = 1 + x/V. + x 2 /2! + • • ••) What would seem to be meant by 
sine D ? by 1/(1 — A)? 

8. What is the first difference of I 2 ? the second and third differences? What 
is the third difference of any second degree polynomial? What is the nth 
difference of any nth degree polynomial? the (n -f- l)th difference of an nth 
degree polynomial? 

9. Write out the difference equation satisfied by the numbers in exercise 2. 

10. Let us define ascending factorial polynomials as follows: x M = x(x — 1) 
- • • (x — n + 1). Verify that Ax (n) = nx ( "~ l) . Of what does this remind you? 

Systems of Equations 

5. Up until now we have been concerned only with a single 
variable. In most applications to an economic system many vari- 
ables are involved. Hence, we must introduce the notion of a 
system of difference equations in m variables (yu • • • , y m ). These 
consist of a set of difference equations in the variables in question. 
Just as a static system can only hope to be determinate if it consists 
of as many equations as unknowns, a dynamic system must contain 
as many independent, consistent equations as there are unknowns. 
Thus, our system takes the form 

fly i(t + »), y s (l + n), • • • , y m (t + »);•••; 

yi{t), yi(t), • - • , y m {t ) ; t] = 0 , 


j m lyi{t + n ), yo(t + n), • • • , y m (t + n); •••; 

yi(t), ■ ■ ■, y m (t ) ; t] 


( 7 ) 
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As before if desired this can be written in finite difference form by 
making use of the relationships between A and £ It is not neces 
sary that each of the equations in the above set should involve the 
same number of lags it is often convenient as a matter of notation 
to write each as if it contained the same number of lags as the 
greatest number n which occurs in any of the equations From 
the discussion at the end of section 2 of this appendix it will be 
dear that we can shift our time reference point so that some 
y (/ + ») appears in each equation 

6 A difference equation system such as is defined in equation 
(7) can be thought of as an identity which holds for each value of t 
It is equivalent then to an infinite number of implicit relations 
This does not lead to overdetcrminatencss because each new time 
period defines as many new variables as there are equations In 
fact if we confine / to a finite number of consecutive values we 
shall always find that ue have less equations than we have un 
knowns Thus the sj stem is formally indeterminate unless we ad 
join certain so called initial conditions or boundary conditions 

This may be illustrated by a sum of money grow mg at six per 
cent interest per annum Its behavior over time follows a simple 
difference equation but the process is completely unambiguous 
only if we specify how much money there was to begin with or 
alternatively if we specify to what value the principal will have 
grown by any given date 

In the above example of a first order difference equation involv 
mg a single variable only one initial condition cm be arbitrarily 
specified Therefore one is said to be the order of the system 
If we had a system consisting of two difference equations each of 
the third order (1 e each involves three lags) then the number of 
initial values which can be arbitrarily prescribed would be seen to 
be tw o times three In general where there are m difference equa 
tions in m variables and each mv olves n lags the number of arbi 
tranly assignable initial conditions will be seen to be m times n 
and this number will be termed the maximum order of the system * 

7 Adding initial conditions equal to the order of the system in 
effect supplies us with the missing equations discussed in the pre 

’In the ntercs *> of expos t n il c nve ence a d u fo n ty I have departed from 
ll e u ual muthemat cal custo n f say ng that n th a case the order of the system s at 
most n t mes m By adopt ng the convent on descr bed at the end of sect on 5 I have 
nsured that the order w II be exactly n t mes m 
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vious section so that our unknowns no longer exceed the number of 
our relations. Actually, it is well known that a system of implicit 
equations equal to a specified number of variables may not de- 
termine a unique solution because of inconsistency, dependence, 
multiplicity of roots, etc. However, in any relevant, empirical 
economic system it is the task of the economist and not of the 
mathematician to specify an adequate number of relations to make 
the system determinate. Moreover, in causal dynamic systems it 
is natural to think of causation as taking place unilaterally in time, 
from past to present rather than from future to present. There- 
fore, it is convenient to concentrate upon the case where the speci- 
fied boundary conditions are truly initial conditions sufficient to 
start the system off, after which it determines its own behavior at 
each step. This means that the implicit equations of (7) can be 
solved explicitly for values of yft + n) as single-valued functions 
of “previous” y’s. 

y<(t + n) = -fin- 1), • • • , y m {t + n - 1) ; • • • ; 

yi (0. • • • - y»(0 ; Q- (* = !>•••, *») (8) 


Under these conditions, if we specify any initial conditions of 
the form 

y,n(0) 


yi(0) ys(0) 

Lyi(» — 1) ys(n - 1) 


y m (n - 1) J 


(9) 


then we can clearly determine from equations (8) the quantities 
[yi(n), ■ ■ ■ , y m (n)~]. Adding these to the previously prescribed 
variables, we are in a position to determine the y’s at the next 
instant of time, and so forth ad infinitum. In other words, by 
the use of the familiar principle of mathematical induction we can 
state an important existence and uniqueness theorem: a system of 
differ ence equations of the type indicated in ( 8 ), subject to prescribed 
initial conditions as given in (P), possesses a solution for all positive 
values of t. The solution is unique. 

This theorem is valuable, not because it tells us how to find a 
convenient solution, but because it assures us that once we have 
found a solution, and verified it by substitution in the difference 
equation and initial conditions, then we can rest assured that it is 
the only possible solution. 
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8. In this section it will be shown that any system of order S, 
whatever be the values of m and n, can be replaced by a system of 
5 first order difference equations in 5 variables. This reduction to 
what may be called normal form is always possible by means of 
redefining S = nm new variables, Yn, • « *, Y nm , as follows: 

Y u = y. 

Yu = Ey t 

(*-l. *••.«) (10) 

Y M = E'-'y, 

In terms of the new variables the reader can show that the system 
defined by (8) becomes 

EY U - Yu 
BYu = Yu 

«-l, ••*.«) (ID 


E7- ltl = Y ni 
EY ni - J\ 


Our initial conditions are written simply as 

[r„(0)] = - l)]. (12) 

By way of illustration let us consider a system consisting of 
one second order equation in one variable. 


y(t + 2) + a.y(t + 1) + a*y(#) = sin y(0) = bo, y(I) b.. (13) 

The order of this system is clearly two. Defining new variables 
as follows 

Y i (t) = y(t), 

Y t (t) = Ey{t), (14) 

we can write the above system in the normal form 


EYt(i) = — c l y J {/) — a,y,(/) + sin /, 
EY.it) = 7,(0, 

7.(0) - b 0l 
Yz{ 0) = b t . 


(IS) 


The reader should work out several examples, such as that of 
exercise 4, section 4 above. 
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It is to be noted that equations (7) can by a similar definition 
of variables be written as 5 first order implicit equations. Indeed, 
an indeterminate system, involving more unknowns than equa- 
tions, can always be converted into an (equally indeterminate) 
first order system. 

9. We have seen that a single high order equation can be trans- 
formed into a number of first order equations. Is the reverse 
possible? Can a first order system (and, therefore, any system) 
be transformed into a single equation, whose maximum order is 
equal to the maximum order of the normal system? Usually the 
answer is yes, although the reduction is easier to indicate than 
actually to perform, and in singular cases, where certain conditions 
on the partial derivatives of the /’ s fail, the reduction may be im- 
possible. Also, a well-determined explicit system may after reduc- 
tion be expressible only as an ambiguous implicit high order relation 
in one variable. Thus, the well-determined system 

Ey 2 = yi 2 ; Ey r = y 2 + yi (16) 

can be reduced to the well-determined second order equation in y x 

E 2 y x — Eyi = y x 2 , (17) 

but in yz we get only the ambiguous implicit equation 

(E 2 y 2 - Ey 2 - y 2 2 ) 2 = 4 y?Ey 2 . (18) 

Let us suppose that we have a normal system of order S, 
y>(t + 1) — *AQyi(f), • • *, y»(t), t}, (i = 1, • • •, s) (19) 

or, in abbreviated notation, 

EY = J(Y,t). 

It will necessarily involve 2S variables, namely the y’s at two in- 
stants of time. Given S equations in 2 S variables, it is not possible 
to eliminate all but one of the y’s and be left with a difference 
equation involving only that y. If we write down the identities 

E 2 Y = J{EY, t + 1) 

E*Y = J(E*Y,t + 2) 


E'Y = J(E*~ 1 Y, t + S- 1), 
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we shall have introduced as many new variables as we have equa- 
tions. But not all of these need be eliminated; only those involv- 
ing all variables except the one in question All together we 
have S 2 equations in S 2 -fr- 3" variables, and we seek to eliminate 
(S — 1)(5 -f 1) variables. Our number of equations exceeds this 
by one, so we can hope to perform the elimination, the remaining 
equation being the desired S order equation in a single variable. 4 

10. Everything said about difference equations applies with 
very little modification to differential equations The reader 
should write out a single differential equation corresponding to the 
single difference equation (1) by writing out the latter using the 
operator E, and then substituting for E the operator D. Simi- 
larly, he should write out the system of equations (7) in terms of E, 
and then derive a system of differential equations by the substitu- 
tion of D for E. The same applies to initial conditions and to the 
notion of the order of a system The argument showing how a 
system of equations can always be reduced to first order normal 
form also applies directly to differential equations. He may check 
his reasoning by referring to the first chapter of the Moulton book 
just cited. 

Exercises 

1 The national income of each of two countries equals domestic investment 
plus domestic consumers’ goods production plus exports The consumpuon 
and imports of each country are respectively determinate functions of income 
in that country in the previous period Set up the system of difference equa- 
tions describing the behavior of each national income over time Specialize 
these by assuming constant marginal propensities to consume and import 

2 Use the 45° line graphical method of chapter x to analyze whether or 
not the equation, F t+l * + F»* =■ 9, defines a unique causal system If neces 
sary, modify the definition of the equation slightly so as to remove all am- 
biguity Can the initial value, F 0 , be assigned completely at pleasure ? (Hint 
try a few arithmetical examples ) 

3 The system, Ey{l) = z(t), Ez(t) = 5 y(t) -f- 2 s(t), was converted to 

normal form from a single equation in y What was the exact form of that 
eryiatinm, and what d^nitnm were, used w. >.t t-a o/armaJ. (atm? 

4 Given the system, Ey i = yt, Eyt — yu derive a second order equation 
in each of the variables alone 

'The reader may consult F R Moulton, Differential Equations (New York, 1930), 
PP 6-9, for a more complete discussion of this problem taken with special reference to 
differential equations For linear equations this is worked out in some detail In P A 
Samuelson, "A Method of Determining Explicitly the Coefficients of the Characteristic 
Equation,” The Annals of Mathematical Statistics, XIII (1942), 424-429 
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5. A body falling in the earth’s gravitational field satisfies the equation, 
d 2 x/dt 2 = — g. Convert into normal form. 

6. Write out a system of difference equations in A form. Define the con- 
cepts of order, normal form, etc. 

11. If only numerical answers are desired, one may simply use 
the difference equation at each instant of time to calculate suc- 
cessively the new values of the variables, and so forth for as long a 
time interval as is desired. However, this may be tedious in prac- 
tice, and inevitably there must be rounding off at each stage so 
that the accuracy of the solution deteriorates with its length. (For 
a stable system if the errors of rounding off remain within a certain 
limit, the errors of the solution will remain bounded.) 

The method of actual numerical examples has had considerable 
use in economics, and is valuable from the standpoint of providing 
illustrations. 5 Still one can never be sure that another arithmetical 
example might not contradict the principles inferred from this one, 
so that the method is of limited usefulness even for the purpose of 
providing qualitative information concerning solutions. After all, 
life is not long enough for us to complete even one sequence, nor 
to try all possible initial conditions, nor to vary the form of the 
structural relations. 

Unfortunately, if we simply consider difference equations in all 
generality, there is no simple analytical way to write out explicit 
solutions. The same is true of differential equations. The be- 
ginning student may be taught a variety of ad hoc tricks which may 
work on particular equations, but the advanced student must rest 
content with his existence theorem which tells him that there exists 
a unique solution; often the proof of the existence theorem tells 
him how he may (painfully) approximate to that solution. 

In both of these lines much more complete results may be 
achieved if we restrict ourselves to linear systems. That this may 
be a serious limitation, especially for business cycle purposes, is 
shown in chapter xi. However, in chapter x it is shown how the 
assumption of linearity is not seriously limiting if we are primarily 
interested in stability, especially in the small. 

In much of what follows, therefore, we shall confine our atten- 
tion to systems which have two properties. First, they are to be 
linear. There is the added advantage that most mathematical 

6 E. Lundberg, Studies in the Theory of Economic Expansion (London, 1937). 
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expositions arc concerned with tl esc Second they are not to in 
volve time explicitly Most economic systems of interest are 
causal rather than historical in nature and satisfy this require 
ment thereby simplifying our task and enhancing mathematical 
tractibility 

In fact if we insist that a system be timeless and at the same 
time linear we are left with the exceedingly simple case of constant 
coefficients whose theory is perfectly complete and elementary 
We shall see that exactly the same thing is true of differential equa 
tions so that one who has mastered cither of these branches has 
little difficulty with the other From a more fundamental point of 
view each of these is simply the exemplification of a more general 
theory of linear operational equations 

12 There is however one non linear case w here a more or less 
complete solution can ahvaj s be indicated That is the case of the 
first order equation of the form 

y+i-flyd (2D 

The 45° diagram of chapter x pictures the various types of cqui 
librium solutions y, = a with the property a — f(a) =0 It is 
also clear from that diagram that a motion which remains bounded 
need not approach an equilibrium position but may approach a 
periodic motion (one of the boxes of the cobweb theorem) Some 
of these boxes are stable and some are unstable We may test the 
stability of a motion which repeats itself ex cry 5 periods by the 
method which Professor Leonticf G has suggested Let us define 

/.(*) =/(*) 

Mr) = JWx)l 

( 22 ) 


Mr) = Jlf.-i(x)l = /[/!/.-*(*) I ] etc 


It is then true that the solution of equation (21) at any time t 
for given initial condition y 0 is given by 


y(t) = Myo) 


(23) 


* W Leont ef Verzogerte Angebotsanpassung und Part elles Gte chgew cbt 
Ischnftfur Naltonalokonontte Band V (1934) 
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However, this is more an indication of a solution than a solution 
itself since the f’s in question may be very complicated functions. 

Suppose that (a 1} a 2, • • ■ , a s ) represents a periodic solution of 
the original equation. This will be true if, and only if, the new 
difference equation 

yt + 1 = Myt) (24) 

is such that 

fs(o-i) = ( i = 1, • • • , s) (25) 

The motion will be stable if, and only if, the above equilibrium 
points of the new single difference equations are stable. This can 
be tested by means of the methods developed in chapter x. 

Exercise : 

Show that the stability of the periodic motion depends upon 

l/>,)/'(a 2 ) •••/'(a s )i§ 1. 

[Hint: What is / a '(a;) equal to?] 

13. The case of the first order differential equation which does 
not involve time explicitly, 

dy/dt = f(y), (26) 

is one which admits of an immediate solution by quadrature, as 
indicated in chapter x. Its richness of solution is limited, every 
motion being monotonic. Hence, if bounded, the motion must 
approach an equilibrium point. There may, of course, be a number 
of equilibrium points alternately stable and unstable. 

Exercises: 

1 . Let f(x) = x\ What is the genera! expression for /*(*)? 

2. Show that every equilibrium value of fix) is an equilibrium value of 
fz( x )- Is the converse true? What about the stability of the common equi- 
librium points, will they agree? 

Linear Systems 

14. A linear system is one which involves the dependent vari- 
ables in expressions of the first degree, i.e., without higher powers, 
cross-products, nonelementary functions, etc. For the purpose of 
this section it is not necessary that time be excluded or that it enter 
the problem linearly. We may write our linear system in the form 

L[ym = m, (27) 
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where L stands for any general operation upon a whole system of 
variables and / represents a known function or set of functions 
For the discussion of this section it is not even necessary that L 
represent difference equations rather than differential equations 
Rather it may represent an implicit functional of y over the whole 
t interval and in the notation of chapter ix page 261 it could be 
written as 

£[>(<)! 1 = / T>W 0 

The fundamental linear property which L must satisfy is as 
follows 

£(yi + y>) = £(yi) + Uyi) ( 28 ) 

from which follows immediately the general relation 

L{aiy % + a 2 y 2 + + o»y») 

= Oi£(yi) + a 2 L(y 2 ) + + a n L(y n ) (29) 

From this general definition of a linear system there follow lm 
mediately two important general theorems 

Superposition theorem The solution of (27) when f is made up 
of the sum of two different fs is equal to the sum of the solutions ap 
propnate to each Mathematically this may be written in slightly 
more general form 

L(y>) = U implies LfEa.j) “ (-30) 

The proof follows immediately from (29) 

This means that the solution of a system with many forces 
acting on it is equal to the sum of the separate effects of each 
Even where this is not strictly true the economic situation may 
approximate to this condition 

For many linear systems it may be possible to give the solution 
for a fundamental set of / s into which any f can be decomposed 
If this is the case our problem is immediately solvable 

Our second theorem states The difference between any two solu 
tions of (27) must satisfy the reduced (or homogeneous) equation 

L(y) *= 0 (31) 

This follows immediately if we substitute each of the solutions m 
77 ) and then subtract one expression from the other The right 
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hand function will cancel out, and according to (29) we may regroup 
the terms on the left-hand side to achieve the desired result. 

A most important corollary of this theorem is derived simply 
by rearranging terms. The most general solution (i.e., any solution) 
of (27) may be written as the sum 0 f some one particular solution and 
the general solution of the reduced equation (37). 

This permits us to concentrate upon the more easily manageable 
reduced equation in order to determine the uniqueness, multi- 
plicity, boundary conditions, etc., of the solutions. It is often 
possible by ingenuity, good luck, or intuition to arrive at a par- 
ticular solution of the unreduced case, and the present corollary 
tells us how to transform it into the general solution, if only we can 
find the general solution of the reduced case. 

Exercises: 

1. Except that there are no boundary conditions, show that the above 
remarks apply to a single linear algebraic equation of the form, ax = b. Show 
that when the reduced equation, ax = 0, has but a single solution, the full 
equation has but a single solution. Can the converse be stated? Can you 
state that the uniqueness of either equation guarantees the uniqueness of the 
other? (Hint: Distinguish three possible cases, neither a nor b equal to zero, 
both equal to zero, only one equal to zero.) 

2. Consider two (or more) linear algebraic equations. Show that unique- 
ness of either the reduced or unreduced equations implies uniqueness of the 
other. Show, however, that the reduced equation can have many solutions, 
when the unreduced has none. Enumerate all the possibilities, and with the 
aid of a textbook on algebra show how one can recognize each case. (Hint: 
The terminology of matrices and determinants will be found useful.) 

3. Show that the following integral, and mixed difference-differential 
equations are respectively linear: 

y(t) + W - a)y{a)da = /(f), 

y'tf) + ay(t - 1) = /(?)• 

Do they involve time explici tly? 

15. In view of the importance of the reduced or homogeneous 
system of the form 

L(y) = 0, (32) 

we shall devote a number of sections to it. Most of our analysis 
will apply to systems of equations, but occasionally it will be de- 
sirable to confine attention to a single high order equation in one 
variable. Where difference equations are concerned, this may 
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written m the form 

a,(l)y(l + n) + a,(t)y(t + n - 1) + + a.(Qy(t) = 0 (33) 

Much of the mathematical theory of difference equations is 
concerned with the case where the a s are simple polynomials or 
analytic functions just as much of the classical theory of ordinary 
differential equations is concerned with the case 

flo(/)y-(<) + «i(/)r '(/) + + a n (t)y(t) « 0 (34) 

where the a s are of the same simple type 

However if we insist that the economic system be nonhistoncal 
l e independent of calendar time and if we also insist that it be 
linear then the a s must be constants and we have the case of 
constant coefficients 

16 Whether or not time enters explicitly it is easy to derive 
from equation (29) the theorem A linear combination of a number 
of solutions of the reduced equation (32) is itself a solution i e 

I(ciyi + c 2 yz + + c y t ) = 0 if L(y ) = 0 (35) 

Since the difference between two solutions is a linear combina 
tion we have the immediate corollary 

L( 0) = 0 (36) 

If the y stands for a whole set of variables the zero must of course 
stand for a whole set of variables equal to zero 

Usually m the analysis of differential equations and difference 
equations as well the mathematician looks for a basic set of special 
solutions in terms of which any solution can be written by linear 
combination This linear combination written with undetermined 
coefficients is called the general solution of the reduced system 

If there is an existence theorem which guarantees a unique solu 
tion of the reduced system satisfying certain initial conditions we 
should of course expect the number of undetermined constants m 
the general solution to be equal to the number of arbitrarily as 
signable initial or boundary conditions or equal to the order of 
the system 5 

The reader should note that not every set of S distinct special 
solutions of the reduced system will form the basts for a general 
solution Thus if one or more of the given 5 solutions can be 
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written identically in t as a linear combination of the others, the 
set of particular solutions is said to be linearly dependent and cannot 
form a basis for the general solution. For a linear combination of 
all S solutions can in this case be regarded as a linear combination 
of less than 5 linearly independent solutions; the number of con- 
stants in such an expression is necessarily less than S, and hence 
we cannot hope to satisfy S arbitrary initial conditions. 

17. However, the reverse is not necessarily true. If we find 5 
solutions which are linearly independent (i.e., which are not linearly 
dependent), it does not follow from our existence theorem that they 
form a basis for the general solution. Indeed, there may exist no 
general solution in the sense in which we have been using the word. 
This is perhaps inadequately emphasized in the mathematical texts 
because of their preoccupation with the special case of differential 
equations where a stronger theorem can be enunciated. Thus, 

D 2 y(t ) = 0 (37) 


has linearly independent solutions (1,1) which form a basis for a 
general solution, in terms of which any solution can be written. 
But 

E 2 y(l) = 0 

has two linearly independent solutions, 

yi(t) = 1, t = 0; y 2 (t) = 1, 

yi(t) =0, 12 0; yi(t) = 0, 


t = 1 , 

t 1 , 


(38) 


(39) 


which do not provide such a basis. For the function 

=1, t= 2, 

= 0, t is 2, 

is a solution of (38) for appropriate initial conditions, and yet it 
cannot be written as a linear combination of the two linearly 
independent solutions. 7 

7 It may be objected that yi(l ) does not satisfy the difference equation (33) for all 
values of t, particularly for l equal to zero. However, a solution is to be thought of as 
satisfying a difference equation if for given arbitrary initial conditions it is in accord with 
the prescribed law. To help clear up the present paradox, it should be pointed out that 
solutions which are linearly independent on one interval of t may be linearly dependent 
on another interval. It is also to be pointed out that the example cited is one in which 
the solution of the difference equation is unique to the right of the initial conditions but 
not to the left. It is also one in which the irreversibility in time disappears if the differ- 
ence equation is replaced by an '‘equivalent" one of lower order. 
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The absence of a general solution docs not me m that we cannot 
find the solution for given initial conditions Our existence 
theorem guarantees us tint we can find such a solution Actually 
such a solution can be written as a linear combination of basic 
solutions However the above example shows us that the linearly 
independent basic solutions may have to be different sets for every 
initial point there need not exist a set of basic solutions which will 
serve uniformly for all initial times 


Linear Operator Equations with Constant 
C orFFICIFNTS 


18 I turn now to the case of difference or differential equations 
with constant coefficients Since it will be shown later that the 
solution for equations of any order can be written in terms of 
solutions of the first order case I shall consider this simplest of all 
cases In the first order case our difference equation becomes 
(£ - <z)y«) = /(/) 

y(l.) = b (41) 


and the corresponding differential equation becomes 
(D - g)y(l) = f(t) 

y </.) - b 


(42) 


Each of these may be considered as special exemplifications of 
a general first order equation in a simple operator h 


(h-q)y(t) »/(/) 

y(/o) = b 


(43) 


For h to be an admissible operator it is only necessary that there 
be an existence theorem guaranteeing that the above equation has 
a unique solution for the arbitrarily specified initial condition 
How this existence theorem is established need not concern us at 
this point we hav e seen how it can be done for difference equations 
and the reader may refer to any work on differential equations for 
the corresponding proof In addition to E and D examples of h 
are A tD E + 5 etc On the other hand such an operator as D 5 
would not be an admissible h because the system corresponding 
to (43) would not have a unique solution This is because two 
initial conditions are necessary rather than one to satisfy a second 
order differential equation 
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We may suppose the solution of the above system to be 
written as a linear functional of/, with q and b as parameters, or as 
F(t, q, b; f)h, where F has the fundamental linear property 

F(t, q, c x bi + • • • + cj) s ; Cifi + • - - + cj s ) h 

= d F(J, q, b u -fi)h + • • • + c,F(t, q, (44) 

From this fact it is easy to verify that h is a linear operator 
with the properties 

h(c\-y x -f- c 2 y 2 ) = cjiyi -f- c 2 hy 2 , 
li 2 y — h{hy), 


(45) 

h"y = hh’ l ~ l y , 
h°y — ly, 
hO = 0, 

P{h)Q{h)y = Q{h)P(h)y, 

where P and Q are polynomials. 

19. So far we have not defined h raised to a negative power. 
It would be natural to define 


y = h-yit) 

as the solution of 

hy = f{t). 

However, it is clear from our existence theorem that this cannot 
have an unambiguous meaning unless we specify some initial con- 
dition. This corresponds to the familiar indeterminacy of the in- 
definite integral of a function, D~ l f. We need a constant of inte- 
gration or a definite lower limit of integration before there can be 
a determinate answer. We shall not have occasion to use negative 
powers of h even though readers familiar with the Heaviside opera- 
tional methods will realize the mnemonic advantages to be derived 
from such a procedure. These advantages are counterbalanced by 
the aura of mystery which surrounds some of the usual expositions 
of this method. In general, the methods here outlined will be of 
“classical type,” albeit in somewhat amplified form. It is the 
present writer’s conviction, which need not be sustained here, that 
a careful examination of the Heaviside-Cauchy operational calcu- 
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lus, of Bromwich Wagner contour integrals, of Laplace transforms, 
etc , will show that where these differ from the classical methods 
the advantages are in favor of the latter These remarks are in 
tended to apply to ordinary differential equations of finite order 
with constant coefficients and do not refer to the use of the Laplace 
transform in connection with partial differential equations, nor in 
connection with integral equations of faltung type, etc 8 

20 Being first concerned with the reduced equation, where /(/) 
vanishes we concentrate upon solutions of the form F(t, q b, 0)* 
Because of the linear properties of h given in (45), this can be 
written in the special form bF{l, q 1 , 0)* Hence, peculiar impor 
tance attaches to the solution of the reduced first order equation 
with initial condition equal to unity Let us determine, there 
fore, the c\act form of the F function for h equal to E and for h 
equal to D 
Clearly if 

Ey = qy, y(/„) « 1, 

then 

y(/o + 1) * qy(U) = q t 


(46) 


y{to + t) *= qy{lo + 1 - 1) = qqy{io 4 - 1 - 2) = = g‘, 

y(t) « 
or 

F(t, g, l,0)s = q‘~‘» (t g k) 

Of course, the final proof of the above relation is to be found 
in the fact that when substituted into the difference equation, 
satisfies it and the boundary condition in. question Our 
existence theorem guarantees that it is the only solution 
We may rewrite the differential equation (42) in the form 

dy/y = qdt (4?) 

Integrating each side, taking antilogs and inserting the appropriate 
constant of integration so as to satisfy the initial condition of unity 
at t equal to f 0 , we easily find the solution to be 

F(t, q, 1 , 0)o = *•> (t g t 0 ) (48) 

•G Doetch Theone und Anwendung der Laplace Transformation (Berlin 1937) 
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The reader may verify that the operators A and tD have re- 
spectively the fundamental solutions 

Fit, q, 1 ; 0) 4 = (1 + 2 )'-'°, (/ > l 0 ) ( 49 ) 

and 

F{t, q, 1; 0)/x> = {t/UY. it ^ to) (50) 

Thus far, it has been implicitly assumed that q is a real vari- 
able. However, no detail of the procedure or proof would be es- 
sentially changed if q were assumed to be a complex number of the 
form ia + ib). It may be argued that no real economic process 
need be described in terms of complex numbers. This can be 
readily admitted. However, when we come to equations of higher 
order, it will be shown that the solution can always be written in 
terms of purely algebraic combinations of the roots of higher order 
polynomials. In order that every polynomial have exactly as 
many roots as its degree, it is necessary to introduce complex 
numbers. Because these occur in conjugate pairs, it will always 
be possible to regroup terms at the end and express everything in 
terms of real numbers. But it is most convenient to work with 
complex numbers in the intermediate steps. 

21. We may now inquire into the stability of this fundamental 
first order solution. It is clear that the initial condition enters only 
as a scale factor and cannot affect the problem of stability. Under 
what conditions will 

lim F(t,q, 1; 0)* = 0? (51) 

/—♦co 

If we examine each of the four specified F’s corresponding to the 
four distinct operators, E, D, A, ID, it is clear that we have the 
following respective conditions: 

E: absolute value of root must be less than one, 
or | q j < 1 ; 

D: real part of q must be negative, or Riq) < 0; 

A: absolute value of (1 + q) must be less than one, 
or |1 -f- q\ < 1; 

tD : real part of q must be negative, or Riq) < 0. 

In the complex plane the first of these is interpreted to mean 
that q must be inside the unit circle so that its absolute value or 
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modulus will be less than unity For the differential system the 
value of g must be in the left half of the complex plane Later it 
will be shown that the stability of the reduced equation determines 
the stability of the system 

22 Having introduced a basic solution of the reduced equation, 
Fit, q l , 0)», I now introduce a basic special solution of the un 
reduced equation By combining these two, the solution to the 
general first order system (43) can be written Our special solution 
of the unreduced equation is that for which the initial condition, 
b, is zero In accordance with our above notation this can be 
written as F{t q, 0 J)\ How it is to be found for any given oper 
ator need not concern us here, our existence theorem assures us 
that it exists, and often suggests how it is to be found 

The reader can verify that the following do represent basic 
special solutions for the respective simple operators 




(52) 


etc 


Because of our existence theorem, no proof other than verification 
is necessary 

Consider now the expression 

y(t) = Fit, 5, 0,/) A + bF{l, q, 1 , 0)* (53) 

Substitution shows that it is a solution of the general equation (43) 
Its initial value is 

y(to) = F(to, q, 0,f) k + bF(t 0 , q 1 , 0 )a = 0 + b, (54) 

satisfying the correct initial condition, and thereby constituting 
the unique solution 

23 We shall show that the most general system 0 / any order 
has a solution consisting of two parts a special particular solution 
of the reduced equation with vanishing initial conditions and a 
“general” solution of the reduced equation with assignable con- 
stants sufficient to satisfy the prescribed initial conditions 
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In fact, once the first order case has been thoroughly analyzed, 
the general case yields to a purely algebraic treatment. In what 
follows, I shall try to bring to a completion the Heaviside program 
of “algebrizing” the treatment of simple operators. As far as I 
know, the mathematical literature, because of its principal pre- 
occupation with special operators, stops short of such a complete 
program; at some stage the concept of integration or summation is 
introduced into the discussion. Moreover, the present fashion of 
thought introduces transcendental notions such as the Laplace 
transform even in the simplest cases of ordinary differential equa- 
tions with constant coefficients. It will appear from what follows 
that the Laplace transform, however useful it may be to one who 
has grown accustomed to it, is neither necessary for rigor nor 
necessary for analytical and computational short cuts. In a 
fundamental sense it is an artifice which “cancels out of the prob- 
lem,” and aesthetically it represents an alien intrusion having no 
“natural” connection with the problem under discussion. Further- 
more, the present treatment may throw light on the connection 
between so-called “classical” and “operational” methods, showing 
their complete equivalence and identity. Aside from the lament- 
able aura of mystery surrounding many discussions of operational 
methods, the literature abounds with diverse, contradictory claims 
and counterclaims as to the advantages of the alternative methods. 

Despite the facts that simple operators are being discussed and 
that an algebraic treatment is being used, the present treatment is 
essentially classical, with one important addition: full use is made 
of the specialized algebraic character of the equations determining 
solution constants. When so amplified, the classical methods not 
only arrive at the same results as the operational methods, but 
actually go through the same intermediate steps. Wherever the 
two methods diverge, the advantage lies with the classical. For 
those who must have their Laplace transform, the concept is suit- 
ably generalized to any operator in section 29. 

24. We may begin by enunciating what is a theorem and not 
an assumption : If h is a simple operator such that the system 

(h - q)y = fit), it to) 

yito) =b ( } 

has a unique solution for suitably Untried functions f, and for q an 
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arbitrary complex number, then the general nth order equation 
P (h)y = h n y + aih n ~ l y + a 2 h'- 2 y + + a n ~ x hy + a n y = /(/), 

O'Cfo) = &o, 

faiOlt-h = b u 

( 56 ) 


- b n -u 3 

has a solution, and the solution is unique 

To prove the first part, I shall specify one solution, leaving its 
uniqueness to later discussion First, let us note the algebraic fact 
that every nth degree polynomial has exactly n roots if we count 
each with its appropriate multiplicity If q, denotes the ith root, 
ive may w rite our polynomial in the form 

P - (*) - (* - «l)(* - i>) (* - in) (5?) 

Let us define n new polynomials as follows 

P„-,(x) = P,(i)/(i - Si) = £ 


Pn-i(x) = P«-i(*)/(* - Jj) 


Ei 

0 


PrW = Prx l(x)/(* - S„„) = Za.ar 1 , 
0 


( 58 ) 


Fi(oc) * P f (x)/(* — g„-i) = £o, I X I “‘ = x ~ q n , 

0 

P,(w) = P, (*)/(* -?,)■=! 

' For brevity we shall write _ , as h l y(l») where there is no possibility of 
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Also, let us define some new functions as follows: 

Zo(t) = /(/), 

(h - g l )Z 1 (t) = Z 0 (t), 

(h — q<2)Zi{t) = Zi(t), 


(h - q n )Z n (t ) = Z„- X {t). 

By our hypothesis on a first order system, each of these is deter- 
minable once we specify initial values [Zi(/ 0 ), Z 2 (/ 0 ), • ■ •> Z n (/ 0 )]. 
In a moment these will be prescribed. 

By virtue of the definitions of Z k {t) in (59), it will be clear that 

KD = Z 0 (t) 

= ( h — qi)Zi(t) 

- (h ~ qi)(h-q 2 )Z 2 (t) (60) 


= (A - ffi) • • ■ (h - q_n)Z n (L). 

Thus, Z„(f) is a solution of our equation, and it remains only to 
make certain that we pick the Zi(/ 0 ) properly so as to yield the right 
initial conditions. 

It is easily verified that 

Zn-iit) = Pj{h)Z n {t). (61) 

Evaluating this expression at the initial time we have 

2 CLiJl^Znito) = £ dijbj-i = Z n -j(to). 

0 0 

(j = 0,1,2, •••,»- 1) (62) 

These are sufficient equations to determine the right-hand terms 
for a given set of b’s. 

To recapitulate our construction : find the roots of the poly- 
nomial P; by synthetic division or otherwise define the n new 
polynomials, P,-; evaluate Zj(t 0 ) in terms of the b’ s; by repeated use 
of the methods of the last section, equation (54), solve in turn for 
Zi(t), Z 2 (/), • • •, Z„(/), using the initial conditions just determined; 
the resulting Z n {t ) is the required solution. This proves the first 
part of our theorem. 
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To prove uniqueness we shall assume that there are two dis 
tinct solutions and show that this leads to a contradiction Clearly 
the difference between any two solutions of (56) will satisfy the 
reduced equation with initial conditions all equal to zero Wc 
have only to show that the reduced equation with zero initial con 
ditions cannot have a solution which is not identically zero For 
the first order case our existence theorem insures that there can 
be no other solution than zero We shall proceed by induction to 
show that if a reduced equation of order (n — 1) has a unique zero 
solution for zero initial conditions then so must an nth order 
equation 

Let us suppose that the (n — l)th order equation .has a unique 
vanishing solution and the nth order has a non vanishing solution 
7(0 Then 

{P(A)7| = (A - q0[ P« i(A)7| = (A — q K )Z « 0 (63) 

where 

A 7(< 0 ) =0 (* = 1 n - 1) 

7(0 ^ 0 

and Z is defined by this equation Our hypothesis on the (« — l)th 
order equation s uniqueness tells us that Z must be nonvamshing 
since otherwise our nonvamshmg 7 would satisfy the (« — l)th 
order equation On the other hand the zero values of the n initial 
conditions of 7 make Z(/ 0 ) be zero With an initial value of zero 
and satisfying a first order equation Z must vanish identically 
Thus we have a contradiction and our conclusion concerning the 
uniqueness of our solution is proved 

It is to be noted that the case of repeated roots in the character 
istic polynomial P introduces no difficulties in the problem Also 
it is to be observed that the method of finding a solution outlined 
above is not necessarily a good one to follow m practical numerical 
computation However since it depends only upon repeated solu 
tion of first order equations it can be readily mechanized or reduced 
to a routine if some suitable method of solving first order equations 
is at hand (e g differential analyzer punch card equipment etc.) 
Exercises 

1 Write out the solution for a reduced second order d ffercntial equal on 
with constant coefficents wh ch has y(0) = 1 y (0) - 0 vh ch has y(0) * 0 

y (0) = ! 
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2 . Write out the solution of the unreduced equation with vanishing initial 
conditions. 

3. Repeat (1) and (2) for any operator h, using the F function. 

25. Having established the existence and uniqueness of the 
solution of the general operator equation of the «th order with 
prescribed initial conditions, we may explore better ways of ar- 
riving at its solution. Our work may be divided into two parts: 
finding a particular solution with vanishing initial conditions, and 
finding a solution of the reduced equation which satisfies the pre- 
scribed initial conditions. Clearly, there are other ways of splitting 
up the problem according to the properties the particular solution 
is to have. Thus, in many physics problems the right-hand func- 
tion, f(t), represents a periodic driving force, and the particular 
solution sought is the so-called steady state response of the system, 
while the reduced equation solution yields the '‘transient,” which 
dies out in a damped system. 

The reduced equation involves no arbitrary functions and is 
simpler. We discuss it first. Its solution for the most general 
case can be built up out of first order solutions, F(t, q, 1 ; 0)*. For 
brevity we shall write the latter as F(t, q). Some of its properties 
may now be developed. 

First, since 

F(to, 5) = 1, (64) 

|p F(h, q) = 0. (65) 

Also, by its definition as a solution of the reduced first order 
equation, 

hF = qF, 
h 2 F = q 2 F , 

( 66 ) 


h n F = q n F, 


and 


P(h)F = P{q)F. 

Because of the linearity of our system, we have 

~ y(t, g 2 ) - y(t, gi) j = P(h)y{l, g 2 ) - P(h)y(t, gQ 


(67) 


pm [= 


q. 2 ~ 2i 


22 — gi 


( 68 ) 
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If y is a continuously differentiable function of the parameter g, 
\vc find in the limit 

For the moment we disregard all boundary conditions and seek 
solutions of the reduced equation 


P(h)y = 0. 

First we establish a lemma: If q is a root of multiplicity r of 

P(a) - 0, 


so that 


then 


P‘(5) = 0, 
P*(S) * 0, 


(t «* 0, 1, • — 1) 


(70) 


PW^F(l,i)= 0. (* = 0, !.•••, ,-l) (71) 

d k 

where the last expression is short for [P(/t) F(t, g )] 7 _7 
Actually 


m = ^ mw. ?) = |: P’ww 01 

= 5) + ^ f(l. J) + ■ • • 

= 0, (fe < 1 ) (72) 


term for term because of (70). 

Of course, any linear combinations of special solutions of this 
form arc also solutions. Every polynomial has n roots, when re- 
peated roots are counted according to their multiplicity; and every 
distinct root yields as many special solutions of the form given in 
(71) as the multiplicity of the root. It follows that we have 
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special solutions, not yet shown to be all linearly independent. 
We take the most general linear combination of these 

d rt m l—l 

y = Fit, gi) + Cu F{t, qi) - f b C ltm _i gi) + 

CioF{t, g 2 ) + C 2 i ^ F(t, g 2 ) d 1- C 2jW2 _ 1 ^ m2 _j F(i, g 2 ) + 


C r oF(t, g r ) + C r i ^ g r ) d b CV.m r -l g^n-1 <7r) 

= Z Z 1 Cnj^F&q,), (73) 

t=i y=o og J 

where (gi, g 2 , • • • , g r ) are the distinct roots of P with respective 
multiplicities (mi, m 2 , • • • , m r ). This linear combination is a solu- 
tion of the reduced equation. It remains to determine the n co- 
efficients (C 10 , • • •, C r , m - 1 ) so as to satisfy the » arbitrary initial 
conditions in (56). Thus 

¥ Z Z' C kj F(/, g fc ) = 6,-, (7 = 0, 1, • • • , n - 1) (74) 

t=i 7=0 oq 7 

or because of (66) and (69) 

Z "f 1 c hj — [g fc ^(/o, g,.-)] = h. (7 = 0, 1, •••,« — 1) (75) 

t=i 0 og 5 

From (64), F(£ 0 , g) is unity regardless of g, so we have 


Z Z 1 a-y(g t i ) 0 ' ) =6/, (7 = 0, 1, 1) (76) 

*=1 1=0 


where 



(* = j) 

(7 < j) 


( 77 ) 
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When all of the roots arc distinct the determinant of this 
system takes the familiar Vandermonde form 


1 t 


1 

?» 


II (9. - 9j). 

<>j 


(78) 


which does not vanish Consequently the n equations can be 
solved to yield the n C s so as to satisfy the n initial conditions 
The proof of the identity (78) follows from the fact that the de 
terminant is a polynomial of the (« — l)th degree in terms of any 
q„ and vanishes for any q, equal to q. This plus the essential 
symmetry in terms of the g’s leads to the right hand side of (78) 
An alternative proof would use induction 

In the general case of repeated roots the determinant becomes 

t 1 0 Ot 0 1 01 

1 9 g 0 9' 0 


2 ?. 

0 q,t 

0 

g* 

0 


(qi) > 

(?!*) * " ?t 


g 

(g,)» l) 

<»> 

(h-IV* 

(q » 1) *•> » « 

(g« i)-r. 

g” 1 

(g - l ) ■ o 



no two q s being equal This generalized Vandermonde determt 
nant can be shown not to vanish 10 This may be proved by indue 
tion but for the present purpose we may omit the proof of this 
fact, and simply note its consequence all of our constants can be 
uniquely determined 

Here I should like to point out that the inversion of a general 
ized Vandermonde matrix can be done once and for all, and the 
result is applicable against varying initial conditions The exact 
form of the inversion is expedited by synthetic division and is 
closely related to contour integrations and the expansion of a poly 

'•PA Samuelson A Simple Method of Interpolation Proceedings of the National 
Academy of Sciences XXI\ (1943) 397-401 Also F R Moulton Differential Eqw 
is (New York 1930) chap xv 
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nomial in partial fractions, so that the classical and operational 
methods meet. 11 

26. I turn now to the problem of determining the unreduced 
special solutions with vanishing initial conditions. In the proof 
of our fundamental existence theorem we have seen one way that 
this can be built up out of the basic solution of the first order 
equation F{t, q, 0 ;f) h . Once the latter is tabulated or catalogued 
for a great variety of the /’ s that arise in practice, the remaining 
work is simply that of calculating 

F(t, q u 0 ;f) h = Z u 

F{t, q u q 2 , 0 -J) h = F{t, q 2 , 0; Z,)* = Z 2 , 

F(l, $1, qt, qz , 0 ;/)a = F{t, q 3, 0; Z 2 )h = Z 3 

(80) 


Fit, q u qz, • ■ ■, q n , 0 ;/) A = F(t, q n , 0; Z„_i) A = Z n . 

The last is the required solution. 

In this section an alternative procedure is described whereby 
our special solution with vanishing initial conditions is built up out 
of the first order solution, F{t, q,0;f)h- First, let us recall the 
algebraic fact that any rational expression of the form 


1 


P{x) 


II (x — q t ) m ' 


(L m, = n) 


(81) 


can be split up into partial fractions of the form 

1 A\\ . A 12 , , Ai mi 


P\x) (x - qi) + (x - qi) 2 


+ 


+ 


+ 


+ ••• + 
Arl 


(x - q x) mi 


+ 


+ 


(x - q r ) T 


(82) 


(x - q 2 ) n 1 (x - q T ) 

In the usual case where all the roots are distinct, the coefficients 
are simply 1 /P'{q,), respectively. When we have a repeated root, 
the coefficients are given by 


1 [ d m ~ ] (x - g,) OT - ~ 1 

Ati ~ (m t - j)\\_dx m -> P(x) 


(83) 


11 P. A. Samuelson, "Efficient Computation of the Latent Vectors of a Matrix,' 
Proceedings of the National Academy of Sciences, XXIX (1943). 393-397. 
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problem. After substitution into the nth 

p(k)y = £ R A,i (, ‘ _ 


= W) 


order equation we have 
■ aiVFjit, Si, 0;f) h 

(90) 


by virtue of the identity (84). 

Note that our discussion of negative operators was purely 
heuristic and not germane to our proof. The expression in brackets 
does not involve negative operators although notationally it appears 
to do so. 

This completes our second construction of the unreduced equa- 
tion with vanishing initial conditions. 

27. The elementary mathematical literature on differential 
equations abounds with trick methods for arriving at particular 
solutions of the unreduced equation for special /' s (e.g., poly- 
nomials, exponentials, trigonometric functions, etc.). At this 
point it is only necessary to discuss two such methods. 

The first is a formal series expansion, of even greater importance 
in the field of difference equations than differential equations where 
convergence is not always certain. Let us write the formal 
expression 


1 1 

P(x ) ao + ape + * • • + OL n x n 


— @0 4 " @ix + @°x 2 + • • • . 

(a 0 3^ 0) 


(91) 


Multiply through by P(x) and equate coefficients of like terms to 
get equations determining the infinite sequence of j3’s in terms of 
the a’s. . 

~ ~ 1 

„ (aiPo) 

„ (a 1P1 + a 2 P 0 ) 

& to ’ (92) 


ft 


(aift-l + <*2ft-2 + • • • + atffoj 
ao 
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In practice there are a \ancty of ways of determining the 0 $e 
quence from the as (algebraic division, difference equations 
Taylor's expansion of 1/P(x), determinants, etc ) 

Formally, a particular solution of 
P(h)y = / 

is given b> the infinite series 

' = Jw 1 = (93) 

where 1 fP is now defined differently from (fc6) and (S7) Whether 
there is any mcinmg to the right hand side that is whether the 
senes is defined and convergent or not depends upon the the /, 
and the a's Thus 

C cxxD + I)v = / (94) 

has i finite convergent solution for /equal to t* of the form 

y “ 1/ “ + "SD V - + + ( - «.)'/d/ 

= ~ a,2t* + a, *4/ - «, s 4 + 0 - 0 + { ’ 

While for / equal to c* we have a formal senes 
3 = E (-«,)•£<“ 

= (96) 

a 

which converges for |«i\| less than one, but seems to diverge 
otherwise However, if vve use the "formal” identity 

< 97) 

wc get the correct answer, c“( t + although not always b> 

.1- higttMmate ijntJi. 

Let us write a difference equation in the form 

«.y(<) + «.>(< - 1) + + o.y(/ - n) = P(C-')rtt) - /(/). 

£>(0) = >(0) = 0, 

£-‘y(0) = 0, (98) 

0 ) = 0 

m = 0 , 


(« < 1 ) 



Then 
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iv(0 P(E~ i)-^) ~ ^ J /(0- (99) 

This will always terminate after / terms, and will satisfy the equation 
and initial conditions. 

Exercises: 

1. Show that /3 ( is a solution of the equation 

P(E~ l )y = a a y(t) + cay(t - 1) + • • • + a„y{t - 11 ) = 0, 
y(0) = 1, 
y(- 1) = 0, 

y{- n + 1) = 0. 

2. If P(x) has n simple roots, qr\ and R(x) = a 0 ~'x n P(x~ 1 ), show that 

= ? R’(qd (gi) ' +1_ ”- 

3. If a, equals a‘/t\, what is /3<? (Hint: Consider series expansion of e° 
and l/e“.) 

4. Write in determinantal form. 

5. Write out the first three /3’s in terms of a’s. 

28. The second artifice for arriving at a particular solution 
comes into use when/(/) is itself the solution of a reduced first order 
equation. That is 

P{h)y = f 

where 

hf = qf. (100) 

We try a solution in the form 

y = cf, (101) 

where c is a constant. We have after substitution and cancellation 
of/ 

cP(q) = 1. (102) 

If q is not a root of P, this has a solution, 

C = WY 


( 103 ) 
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In the important electrical engineering studies of frequency re 
sponse, c is the transfer function, for q a pure imaginary, it is an 
impedance 

If q is a root of P, we ha\c the phenomenon of resonance, and 
we must try for our solution 


„ _ cd~m 

' dq" ’ 


(104) 


where m + 1 is the multiplicity ol the root q 
and cancellation we find 

1 

c “ e-(q) 


After substitution 


(105) 


If / is composed of two or more components, ft, ft, , which 
arc each solutions of the first order reduced equation with q equal 
to q\,qt , by superposition we find a special solution of the 
unreduced equation to be 


y 


pL) /l + P(?.) /,+ 


J106) 


If any of the g’s arc roots of P, the modification given m (104) will 
of course be in order 

Important examples arc provided periodic driving forces in 
the field of differential or difference equations Any well behaved 
periodic function can be written as a Fourier scries, of which each 
term satisfies a first order differential (difference) equation with 
pure imaginary (complex root of unit modulus) coefficient Unless 
perfect resonance occurs, as it cannot in a damped system, our 
solution can be built up term by term of the same harmonic ele- 
ments but with coefficients differing by the factor, \/P(q,) The 
resulting solution (if it converges) will be a periodic function of 
time of the same penod as the driv mg force 

Finally, if our f{t) can be written as a general integral 

m = fw.gwnq), ( 107 ) 


where 


MV. q) = qF<J, q), 
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and c represents some contour in the complex plane, then we try 
the formal solution 

y = f c F ((, q)~jdW(q). (108) 

In cases where P(q ) vanishes, this may be an improper integral 
whose limit nevertheless often provides a satisfactory solution. 

The mathematician will recognize that in the field of differential 
equations, all the /’ s that are likely to arise in practice can be 
written in the form (107), by setting q equal to ia, and F equal to 
e ial , and using Fourier’s integral theorem. 

29. The artifice of the Laplace transform will now be general- 
ized to a wide class of simple operators other than D. The Laplace 
transform is defined as 

&{q',y ) = I e~ qa y{a)da, (109) 

Jo 

which will be convergent for a wide class of y's, for the real part 
of q less than some constant. 

The fundamental property of the Laplace transform which 
makes it a useful artifice in differential equations is as follows : 

£(?; Dy) = q&{q\y) - y(.0), 

£(q;D 2 y) = q 2 £(q;y) - qy{ 0) - Dy{ 0), 

^ (HO) 


£(q; D n y) = q£{q\D n ~ l y ) — D n y{ 0) 

= q n £(q;y) — g n-1 y( 0) — q n ~ 2 Dy(0 ) — • * • 

— gIT -1 3 ’( 0 ) — D n y( 0). 

For then taking the Laplace transform of both sides of 

P(D)y = t diD^y = /(*), 

0 

D k y(! 0)=h, (k = 0, 1, ••*,»- 1) 

we get 

P(q)£(r,y) = £(2;/) + &o(a<>2 n-1 + a & n ~ 2 H a »-») 

+ &x(Oo2 n_ 2 H h ®n-2) H 

-j- 671—2(^02 -f - ®i) ~h bti—ido. (112) 


(111) 
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This is an expression in q and involves no operators Solving 
for the expression involving the unknown function, we have 

(U3 > 

where R is a polynomial involving the initial conditions, b k , and 
is less than n in degree Because / is known, the Laplace trans 
form of our unknown function is determined Under general 
mathematical conditions we can prove that our solution y is “essen 
tially” unique once its Laplace transform is determined One 
method of inversion is given by contour integration 

y (<) - 2«X_i < 114 ) 

where the contour is to the right of all poles of f 

In any case, once P(g) is determined, the determination of y(Z) 
is a detail which need not concern us Extensive tables are avail- 
able to aid in inverting Laplace transforms involving a wide class 
of f's 

To extend the theory we need only seek a junctional appropriate 
to any admissible h, with the fundamental property 

£ (g,hy) h = q£.(q,y) h - y(t 0 ) (115) 

The following expression formally answers our problem 

*■*-!= %#& • (116) 

where q and / are such that 


m 


'F(t,q, 1.0)* 


(117) 


The proof that this is our generalized Laplace transform is given 
Ofo Whowb 


By definition 


■C(?. hf) * = hm 


F(t,g r 0,hfh 
F{t, q, 1,0)* 


(118) 


(A - q)F(t, q, 0 ,/)„ = J, 
(A — q)F(l, q, 1,0), = 0, 
(A - q)F(l, q, 0, A/), = hf. 


F(lc, S, 0,/), *= 0 
F(!„g, 1,0), = 1 (119) 

File, q, 0, A/), = 0, 
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and each of these systems has unique solutions. Now it can be 
verified that 

F(t, q, 0 ;hf) h = hF(l, q, 0 ;/) A - f(t 0 )F{t, q, 1; 0),„ (120) 

because 

hF(t 0 , q, 0 ;f)h - f(lo)F(t 0 , q\ 1 ; 0)* == f(t 0 ) - f(t 0 ) = 0, (121) 

and 


(h - q)\JiF{t, q, 0 ;/)ft - f(t 0 )F(t, q, 1 ; 0) A ] 

= h[_(h - q)F(t, q, 0 ;/) A ] - f(t 0 )(h - q)F(i, q, 1; 0)* 

= TO - 0. (122) 

Therefore, 





hF(l, q, 0 ;/)/, 
F(t, q , 1 ; 0) A 



_ . Wt, g, 0 ;f)h , f(t) 

K 5 F(l, q, 1 ; 0), + F(t, q, 1 ; 0)„ 

— g£ (?;/)>< + o — m 


M 


(123) 


for q and / limited as in our hypotheses. Q. E. D. 

Thus, the solution to the general nth order problem with pre- 
scribed boundary conditions is given by the inversion of 


or formally 


, , R(q) . 

£(Q>y)h P(q) 




P(q) 


+ £~ 


= Y{q) hl 

<fi(g;/)* ~l 
P(q) J 


(124) 

(125) 


Moreover, even where there is no convergence of £, “£” can be 
“inverted” as the mathematically inclined reader may work out 
for himself. So long as /(/) leads to a unique solution of the first 
order operator equation, its behavior at infinity is irrelevant. 


Exercises: 

1. Show that £{q ; y)E = H — Y[q)s, where to = 0. 

o 2 

2. What is £(q; y)p What is £{q\ y) id? 
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3 Make and verify a table of transforms for E as follows 


J(l) 1 

F(?)e 

a* — F(t a 1 0)* 

i 

2 - a 

/a* 1 

(^r 

tU - 1) (t — it + l)a‘ » 


»' 


F(l a 0 yi)» 

LMa 

q-a 

£ yi(*)y»(/ - 1 - *) 

0 

?Aq)E? t (q)E 

d*y(< a) 

d*Y(g a)h 

da" 

da" 

! 



4 Trace the relationship between the classical and the generalized 
Laplace transform method 


Many Variable Systems 

30 Let us return now to linear systems of many variables 
(yi yi y m ) For the dynamic system to be determinate, we 
must have at least nt operator equations These can, of course, be 
of high order, say n or less and can be written as 


Go i\h n yi -f Go i zh n yi -f 

Go 2 \ h n yi + Go 22 h n yi -f- 


+ oi lift" l yi 4- 

+ On i-y« = MO, 

+ a, uk’-'y, + 

+ a. i,y„ = /i(f). 


(126) 


Os mih n y, + a 0 + -f ai 4 

+ On - /-(<), 

or as 

Pn{l>)y, + Pn(h)y , + + Pi-W?. = MO. 
Pn(ft)yi + Pu(ft)y, + + Pt m {h)y„ = MO, 


Pm >(ftbi + P„ j(ft)j! + 


+ P-»(ft)y. =■ MO, 
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where Pa{h) are polynomials in h of the nth. degree or less; or in 
matrix notation as 

a 0 h n y + aji n ~ l y + • • • a n --Jiy + a n y = /(/) 

and 

Py =/, 

where (a 0 , ■ ■ ■ , a n ) are in~ matrices, y and / column matrices of in 
elements. Of course, many of the elements in the a’s may consist 
of zeros. 

The maximum order of our system will be nm; but its actual 
order may well fall short of this. By the actual order of the system 
we mean the maximum number of initial conditions which can be 
arbitrarily imposed on the system. 12 

As an exercise in matrix notation the reader may work out the 
demonstration that by redefinition of variables 13 the system can 
be written as a first order one, 

A 0 hY+A 1 Y = F, (127) 

where 


I 

0 

0 

0 

0 

I ••• 

0 

0 

0 

0 ••• 

0 

I 

0 

0 ••• 

0 

a 0 


0 

-I 

... 0 

0 


'O' 

0 

0 

... 0 

0 


0 

. 

. 

... 

■ 

, F = 

. 

. 

. 

... 

• 


- 

0 

0 

... 0 

-I 


/. 


&7I—1 

... 

a x 



12 It can be shown that the order of the system, s, is equal to the degree ih 5 of 
the determinant 

A (g) = |ao2 n + oig" -1 + • • ■ + a »l- 

See R. A. Frazer, W. J. Duncan, and A. R. Collar, Elementary Matrices and Some 
Applications to Dynamics and Differential Equations (Cambridge, 1938) for references. 
See par. 5.4. 

13 Frazer, Duncan, and Collar, Elementary Matrices, par. 5.5. 
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Although our system has been converted to a first order systei i, 
it is not yet in “normal” form with h Y expressed explicitly in terms 
of Y and functions of time 

hY = AY+f (128) 

If A 0 is non singular, that is, if its determinant does not vanish, 
we can premultiply both sides of (127) by Af 1 and immediately 
convert to normal form 

hY = — (A 0 ~ l Ai)Y+ ( A*~ l F) (129) 

Similarly m (126) if a 0 is non singular, our operators of highest 
order can be solved for in terms of those of lower order by the act 
of premultiplication with Oo - ' 

If a o is singular our system is not well specified One of three 
situations must then prevail (1) there exist purely statical, algebraic 
relationships between some of the variables By means of these 
relations one or more variables can be eliminated, and the leading 
matrix of the resulting smaller system will then be non singular 
(2) the relationships are not all independent so that a unique solu 
tion does not exist Arbitrary functions of time will enter into 
the solution (Example A system with two identical equations, 
permitting one variable to be chosen as an arbitrary function of 
time) (3) the system is inconsistent 

31 It is up to the economist to present a well specified dy- 
namical system In what follows we shall assume that this task 
has been adequately performed so that whenever desirable our 
system can be written in the normal first order form of order r 

hy = Ay + f, (130) 

with initial conditions 

y(t 0 ) = b 

As m previous sections we shall find our solution in two steps 
first, the solution of the reduced or homogeneous system 

hy = Ay, (131) 

y{to) = b 

and then the solution of the unreduced system 
hy = Ay +/, 
y(/o) = 0 


( 132 ) 
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The sum of these two solutions will be the solution to our general 
system (130). 

32. Earlier we have seen that the solutions of the simple first 
order equation in one variable provided the building blocks for 
the solutions of more complicated cases. We try, therefore, a 
solution of the form 

yi = CtFit, q, 1; 0) A = c x F(t, q), 


(133) 


y, = c s F(t, q, 1; 0) A = c a F(t, q), 

where F is as defined in earlier sections, and the e’s and q are un- 
known constants. After substituting into (131), we get by means 
of the fact that hF equals gF, 

qc\F — A\\C\F -f- A\ 2 c 2 F -f- • • * Auc,F, 
qc 2 F = A 2 \C\F -}- A22C2F +•••-}■ A 2 3C,F, 

(134) 

qc s F = AgiCiF -}- A 32 C 2 F -j- • • • T A„c,F. 

The common factor F may be canceled from both sides of the 
equation, leaving us with s algebraic equations which do not involve 
time. If these equations can be satisfied, we shall have a solution 
of the operator equation, although not necessarily the solution 
with appropriate initial conditions. 

Our linear equations involve known A’s, undetermined c’s and q. 
Our equations are homogeneous in the c’s and may be written as 

(A 11 — g)ci -f- A 12C2 + • * • + AuCt = 0, 

A21C1 T (A 22 — s ) c 2 "T * ■ ’ "T A 2 sCe — 0 , 

(135) 


AslCi -f- Ae2^2 + * ' * H" (At! q) c > 

If all the c’s are zero, we have a trivial solution of the operator 
equation. When can we find a non-trivial solution? From the 
theory of ordinary linear equations (Cramer’s rule, etc.), we know 
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that a system of homogeneous linear equations has a non trivial 
solution if, and only if, its determinant vanishes, in this case if 

.<4 11 — 5 A it ^4i« 

■<4*i -<4jt — 2 Au 

A(g) = 

A,i A, • A„ — q 

*= *ro2* 4- jrig* -1 + 4- irj-ig 4- *r, = 0 (136) 

For a q chosen at random the determinant will not vanish, and 
we shall ha\e no non trivial solution But g is at our disposal, we 
select it to make the determinant vanish The determinant is a 
polynomial of the sth degree 1x1 g, and is called the characteristic 
equation of the matrix (or secular equation) The roots of the 
characteristic equation must necessarily be 5 in number (counting 
multiplicities) They are called characteristic or latent roots of 
the matnx If we designate them by (gi , g t ), we can wnte 

A(q) » (21 - 2) (2* ~ 2) (2. - 2) *= 0 (137) 

It is not easy in fact to find the characteristic roots of a matrix 
From the definition of a determinant, the coefficients of the charac- 
teristic polynomial can be determined as the sums of principal 
minors of a gnen order (cf Mathematical Appendix A, p 374), and 
any of the customary methods for solving a polynomial for its 
roots may then be employed Ev en w ith the best short cuts, 14 the 
computations are tedious Fortunately, the economist does not 
always require the quantitative values of the roots 

Let us return to the problem of finding solutions to our reduced 
operator system If we put a characteristic root, q„ into the F m 
(133), and take for our c's a non vanishing solution of (135), we 
shall have a solution which satisfies our reduced system Note 
that our selection of c’s is not unique, doubling each will also give 
us a solution 

A column of c’s satisfying the homogeneous equations is called 
a characteristic vector or latent vector or sometimes a modal column 
or normal component Some of the characteristic roots may be 

“ Frazer Duncan and Collar, Elementary Matrices chaps iv and v 
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complex; the characteristic vectors will then be complex. Of 
course, conjugate complex magnitudes can always be combined so 
as to give only real quantities in the final answer. 

If a root qi is simple, then A '(g f ) is not equal to zero, and the 
rank of [ A — g,T] is (s — 1). Consequently, the latent vector 
corresponding to g,- is unique except for scale factor. We may dis- 
pose of this ambiguity by some normalization convention. 

If all roots are distinct, the general solution can be simply 
written. Where roots are repeated, certain complications occur. 
However, these are not intrinsic and rarely concern the economist. 
Later, I show how these difficulties can be avoided. But for true 
mathematical insight the reader must be referred to the theory of 
elementary factors and divisors. 15 

If the 5 roots are simple, and the 5 latent vectors have been 
normalized, we have the following particular solutions of the re- 
duced equation (131): 

Ci,iF(t, gi); c U2 F(t,q 2 ); ••• c lf ,F(f, q,)i 

c 2 ,i F(t, qi) ; c 2 , 2 F(t, q 2 ); - • • c 2i ,F(t, q.) ; 

: : : 

c»,iF(t, qi ) ; c., 2 F(t, q 2 ); • • • c,,,F{t, q.), 

where F(t, q) is an abbreviation for F(t, q, 1 ; 0)^. Any linear com- 
bination of these is a solution, or 

3 V = KiCj'iFlt, qi) -f- K 2 cj, 2 F{t, q 2 ) + * * ■ d - K s Cj,,F(t, g»). 

(7 = 1 , •••,*) ( 139 ) 

Note that the c’s and q’s depend only on the matrix and not on 
the initial conditions. But the K’s must be determined by the 5 
initial conditions. Thus, 

yi(0) = KiPui K 2 C\, 2 T K s Ci'S = bi, 

^2(0) = K 2 c 2 ,i “h K 2 c 2 , 2 -{"■'■ d - K t c 2 ,s = b 2 , 

; (140) 

y e ( 0) = Kic s ,i + K 2 c s , 2 4- • • • 4- K,Cs., = b„ 

u M. B ocher, Introduction to Higher Algebra (New York, 1933). 
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cK = b. 

These equations are sufficient to solve for the K’ s in terms of the 
6’s if the matrix c is non-singular. The reader may be referred to 
any mathematical text for the proof that C is non-singular, and that 

c-Mc = 18 (141) 

In the case of distinct roots, the solution to the reduced system 
(131) can be written in compact form as 17 

y = c[_F(t, (142) 

33 If b is the identity matrix, we have a fundamental set of 
solutions This may be written formally as F(t, A), either as an 
extension of Sylvester’s Law of Latency mentioned in Mathe- 
matical Appendix A, footnote 13, or from the definition 

F(t,A) = imMA*, (143) 

0 

where the m’s are defined by the formal identity 



F(l. s) = £ Wi(0s‘- 
0 

(144) 

Because of the property 



hF ~ qF, 

(145) 


h £ «j(02‘ = £ w,(flg‘ +I . 

0 0 

(146) 

and we have the formal identities 



hm a (t\ = 0, 

hm,\t) = bii-jW. (• = 1. ■ • ), 

(147) 

and hence 

hF(f, A) e AF(t, A) 

(148) 

Also 

F(t„A)mI. 

(149) 


Formally, these above relations hold even if there are equal 
roots. The formal solution given in this section is sometimes a 


11 In Mathematical Appendix A this fact » used in connection with symmetric 
, for which e is orthogonal, i e , r* ■ c 1 
,T Frazer, Duncan, and Collar, Elementary Matrices, chap v. 
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very convenient one. For the differential equation system 

Dy = Ay, (150) 

F(t, A, 1; 0 )x> takes the form 


F d = e Al = I-f + (151) 

For the system 

Ey = Ay, (152) 

we have 

F(t, A, 1;0) B = A‘. (153) 

However, for stability investigations these solutions are not very 
convenient. 

34. If instead of the roots all being simple, some are repeated, 
we may write 

Ms) - (Si ~ s) mi (S 2 ~ S) mi (Sr ~ s) m ', (154) 


where (gi, ■ • • , g r ) are the r distinct roots with respective multi- 
plicities (mu • • •, m r ). 

As before, there are special solutions of the form, C{F(t, g x ), but 
the c’s are no longer unique except for scale factor; in fact, even if 
the c’s were uniquely defined, there would not be a sufficient num- 
ber of particular solutions to combine to get a general solution for 
5 arbitrary initial conditions. 

Earlier, in the case of multiple roots we tried solutions of the 


form, 


dF(t, g { ) d 2 F(t, g x ) 


d m ~ 1 F(t, g { ) 


dg ’ dg 2 ’ ’ dg m( 1 

Let R(g) be the adjugate of the g matrix, 

P(g) = A - gi; 


, and so we shall here. 


(155) 


that is, R(g) is the transpose of the matrix made up of the cofactors 
of P(g). 

By means of algebraic theory of no interest here, it could be 

shown 18 that from the columns of the matrix £ — can be 

selected vectors which when multiplied by the partial derivatives of 
F will give a complete solution. However, these difficulties can be 

18 Frazer, Duncan, and Collar, Elementary Matrices, par. 5.10. 
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side stepped by the device of reducing the s first order equations in 
5 different variables to single 5th order equations in each variable Is 
Thus if 

hy ^ Ay, (156) 

y(i a ) - b, 

then 

hy(t 0 ) = Ab 
h'yito) *= A'b 

(157) 


h-'y(t 0 ) = A^b 

Because of the Cayley Hamilton Theorem that every matrix satis 
fies its own characteristic equation or 

A(A) e 0, (158) 

we find from (156) that 

A (h)y = (to h' + ti h— 1 + -f T,)y “ A(/l)y => 0 (159) 

Hence, each variable say y,(t), satisfies an jth order equation We 
also know from (157) the values of Cyy(M> 0 )] 

Hence, we have a well determined system whose solution can be 
found by the methods of sections 24, 25, and 26 It is clear then 
that our final solutions take the form 


y>(<) - c, nF(t, q.) + C, „ + +c, a "‘ 


+ c j nF( l, gi) + Cj S j 


a s 


+ + Cj imr-i 


dq- 


(160) 


+ c, ,i F(t, j r ) + c, „ 


ant. !') 

dq 


+ +c,™- 


d n, ~ I F(t, gr) 
Ss”' _l ' 
0 = 1 . .-*) 


** The method of attack suggested here is not only theoretically simple but is also 
computationally optimal See P A Samuelson Efficient Computation of the Latent 
Vectors of a Matrix Proceedings of the National Academy of Sciences XXIX (1943) 
397 
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where the s 2 c’s are not all independent. All but 5 are determinable 
from the matrix above. The remaining 5 depend on the 5 initial 
conditions. 

Thus, terms of the form dF k / dq k may appear if there are multiple 
roots. But they need not appear even if there are multiple roots. 
That is, whatever the initial condition b, the coefficients c s , ik may 
be identically zero for (1 < k < m t ). For example, in the matrix 

A = C“ 35,/], (161) 

— 3 is a repeated root of multiplicity s. F(t, — 3) appears in the 
final solutions, but not dF(t, — 3)/dq, or any higher derivatives. 
The reader must again be referred to the theory of elementary 
divisors for insight into this possibility. 

As far as stability is concerned, except for borderline cases 
(which we have excluded in our strong definition of stability), it is 
clear that 

lim F(t, q) = 0 

implies 

lim ^ = lim lim g + ~ = lim 0 = 0. (162) 

/—♦co (sff A£— K) ^2 Aq—*0 

Therefore, it does not matter whether or not the partial derivatives 
of F are suppressed. 20 

35. We have now given the complete solution of the reduced 
equation. By the same method as the last section we may indicate 
the solution of the unreduced part. 

hy = Ay + f, (163) 

y(t 0 ) = 0. 

10 A borderline case on the boundary between stability and instability is provided 
by a symmetrical conservative physical system. Here F(l, iq)D = e ' qt ar, d is neither 
damped nor undamped. However, dF/dq = ile'v 1 is explosive, as is well known from the 
theory of resonance. But the autonomous reduced system y = Ay will not have explosive 
solutions when there are multiple roots; for A = A the theory of elementary divisors assures 
the vanishing of coefficients of all terms of the form ( il) k e i & so that only an external Jorce can 
cause explosive resonance. [Added in 1951 : L. J. Savage ( J ournal of Poliltcal Economy, C , 
1948, p. 202) has correctly pointed out the invalidity of this double-limit argument, 1 us- 
trating with h = §(exp- 2t) X (D + 1) and F(t,q) = exp ( - t + q exp 2 1) for imaginary 
?•] 
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We easily see that 


h°y = y, 
hy Ay + f, 

h 2 y = Ahy + hf — A 3 y -f Af + hf, 
h*y = A'y + A*f + Ahf + h*f, 

(164) 


h-'y = A'-*y + A‘~*f + A‘~ 3 hf + • • • + Ah'-*/ + h'~ 3 f, 
h’y = yl*y + + • • • + Ah*~ 3 f 4- A' -1 /. 

Multiplying the first line by ir„ the second by v t -j, the last by ir 0 , 
and adding, we have 

A(k)y = A(A)y + vi»f + mjif + • ■ • m,- y h‘~ l f, (165) 

where the first term on the right-hand side vanishes by the Cayley- 
Hamilton theorem, and where the m's are linear combinations of 
powers of A. 

Each of the variables (y u • • •, y n ) must satisfy an unreduced 
sth order equation. The right-hand driving functions seem to 
depend upon hf, h*f as well as /; and yet, we have nowhere made 
the assumption that expressions of the form h k f need even be de- 
fined. If we write out a typical equation, say in y u the paradox 
will be dispelled. We have 


or 


W)yi - RiW. 


(166) 


-g.w 

■ A{h) * 


Because Ri is of degree (s — 1) at most, it will be clear that 
(by partial fraction expansion or otherwise) we can avoid all terms 
of the form h k f. Only expressions of the type F(t, q,b;f)h will 
appear. 

We have now only to specify the correct initial conditions. 
Recause 


A*y(f 0 ) = 0, (fe=0, !,-••) 
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our initial conditions should be 

hy(l B ) =/(/ 0 ), 
h 2 yOo) = Afit o) + hf(lo), 


( 167 ) 

h'-'yVo) = ^«- 2 /(/ 0 ) + A-Vifilo) + ••• 

+ Ah'-Zj{l 0 ) + h*-*f(to). 

We now have an unreduced 5th order equation with prescribed 
initial conditions which each variable must satisfy. Our problem 
has therefore been reduced to that of earlier sections, which has 
been completely solved. 

Exercises : 

1. Show that C e A(, ~ ,0 ~ w> f(w)dw upon proper definition becomes the solu- 

«7( 0 

tion of (132), when h is D. 

i-i 

2. Show that £ performs the same role for h equal to E, l B — 0. 

3. By using the generalized Laplace transform £(q;y)h, defined earlier, 
go from 

ky = Ay + f, 
y(to) = b 
to 

]w( 9i /)0 + [|f ]s, 

where [Tii(!z)/A(g)]] equals [_ql — cf] -1 and consists of rational functions whose 
numerator is of lower degree than denominator. Show that expressions of the 
form hi fit) do not enter into the solution in an essential way. 

36. We have already seen in Section 21 of this Appendix that 
it is necessary and sufficient for stability of a differential equation 
system with constant coefficients that the real parts of the roots 
of the characteristic polynomial should all be negative, i.e., the 
roots must all be in the left-hand half of the complex plane. Other- 
wise we should not have the required dampening factors in our 
exponential solutions. 

Fortunately, it is not necessary to solve the characteristic equa- 
tion for its roots to determine whether these stability conditions 
are satisfied. There exist known simple necessary and sufficient 
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conditions on the coefficients of the polynomial which guarantee 
stability These are known as the Routh Hurwitz conditions 21 
Let our general polynomial of the nth degree be written as 

/(X) - aoX * 4- aiX*-' + + o„_,X + a* - 0, (168) 

where ao can always be regarded as positive and will often be taken 
to be unity for convenience 

If the a coefficients are all real, it is a familiar fact that such 
complex roots as occur will appear in conjugate pairs, (u 4- tv) and 
(u — tv) If we write the polynomial as the product GoII (X — X<) 
these complex roots will combine to form real quadratic factors 
It follows, therefore, that every polynomial can be written as the 
product of quadratic factors and linear factors 

f(X) ~ a,{X -f m»)(X + mi) (X + «,) 

(X* + b t X 4- c,)(X* 4 - b s X 4- ci) (169) 

The linear factors can be grouped into quadratic factors except 
when n is odd, in which case / can be written as a product of 
quadratic factors and at most one linear factor 

Now the roots of / cannot be “stable/ unless the roots of each 
linear and quadratic factor are stable We must investigate then 
these simple cases, which can fortunately be treated exhaustively 
m a simple way The linear factor 

A 4- Wi =** 0 

clearly has a Routhian root (one whose real part is negative) only 
if its coefficients are of the same sign For — nil is its root, and 
mi must hence be positive 

A quadratic factor of the form 

X 1 4- bX 4- c = 0 
has roots which can be written as 


X = 


- b ± ^ - Ac 
2 


(170) 


n E J Routh Stability of Gtven State of Motion (London 1877) Adams Prize Essay 
Dynamics of a System of Rigid Bodies (6th ed London 1930) part II chap vi A 
1 witz Ueber die Bedingungen unter welchen eine Gle chung nur Wurzeln nit 
reellen Thetlen besitzt Malhematische Annalen XLVI (1895) 273-284 
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If b 2 — 4c is greater than or equal to zero, the roots are real and 
the quadratic expression can be factored into the product of two 
linear expressions, ( X + m r ){X + m 2 ). It is obviously necessary 
then that the coefficients b equal to {m 1 + mf) and c equal to niiPio 
should both be positive; for the sums and products of positive 
numbers must be positive. 

We have so far proved only the necessity for the coefficients 
to be positive in the real root case. Are these conditions sufficient 
in the real root case? And what about the complex case? Leav- 
ing the latter aside, we easily deduce that positive coefficients are 
also sufficient to give negative roots. The first term in the numer- 
ator of (170) is clearly negative. If we choose the negative sign 
before the radical, we have one root which is clearly negative. But 
the other root will be positive only if the second term outweighs the 
first, i.e., depending upon whether b 2 is less than b 2 — 4c. By 
hypothesis, c is positive, and it follows that the second term cannot 
outweigh the first. Thus, positive coefficients are both necessary and 
sufficient for negative roots of a quadratic, when the roots are as- 
sumed to be real. 

The complex case is even simpler. If we add the two roots, 
(u + iv) and (u — iv), the complex part cancels out, and — b is 
twice the sum of the real parts. If these are negative, then b must 
be positive. In any case c is positive, or the roots would not be 
complex, i.e., b 2 — 4c is less than zero only if c is greater than zero. 
Therefore, our coefficients must be positive if the roots are Routh- 
ian. This proves necessity. If both b and c are positive and the 
roots are complex, it follows immediately from (170) that u must 
be negative. This proves sufficiency. 

Thus, from our treatment of the real and complex cases, we can 
enunciate the simple theorem : a second degree polynomial has roots 
whose real parts are both negative if, and only if, its coefficients are 
both positive. 

Immediately we can deduce certain necessary (but not suffi- 
cient) conditions which must hold in the general case if the roots 
are all to be Routhian. Each linear and quadratic factor must 
have positive coefficients, and, therefore, all the coefficients of our 
polynomial must be positive. This necessary condition would also 
be sufficient if we dealt only with real roots by virtue of Descartes 
Rule of Signs discussed in every book on algebra. 
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But there is no reason in the general case to exclude complex 
roots.” What if some of the b's arc zero or negative, but in com- 
bining to form the a coefficients the positive coefficients alway 
outweigh the negative ones? Then our a coefficients’ all being 
positive would not be sufficient. A simple example establishes 
this possibility. 

(X + 1)(X‘ - .001X + 1) = X s 

+ .999X 2 + .999X + 1 = 0. (171) 

Note that all the o’s are positive even though the complex roots 
have real parts which are positive (*= + .0005). 

Now that wc are warned, a number of possible modes of attack 
are open to us. We may tackle the third and fourth degree cases 
by themselves, knowing that but rarely will higher degree equations 
be involved. Or we may deduce further, stricter necessary condi- 
tions, hoping that when we have found enough of them they will 
be sufficient. 

Thus, in the third degree case we have 

X 3 + fl,X 2 + <z*X + o, = (X + m)(X* + bX + c) 

= X J + (m + 6)X* + (c + bm)X + me. (172) 

We form the product — never mind why — 

Oja? — a 0 o s = (m + b){c + bm) — mc{ 1) 

= b{c + mb + nz*) + 0. (173) 

If the roots are all Routhian, c, b, m are all positive, from which it 
follows our expression fl 1 a 2 — flocij is positive. 

We now have a new necessary condition, or four in all, 

fli > 0, 

c* > 0, (174) 

a, > 0, 

Olfl2 — CoOs > 0. 

Are they sufficient? If we seek an example which satisfies them 
and yet yields non-Routhian roots, we shall be disappointed. Or 
so the author’s experience suggests. But how can we be sure that 

* If the polynomial in question Is the characteristic equation of a symmetric matrix, 
roots must all be real, and the simpler analysis can be applied. 
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a more clever, a more zealous investigation will not produce such 
an example? Only by a sufficiency proof. 

As a matter of fact, these four conditions will be later shown 
to be both necessary and sufficient. But there is still one hitch. 
They are not all independent. The reader may verify that the 
condition g 2 > 0 can be deduced from the other three. (Can any 
one be deduced from the other three? Which can, and which 
cannot?) It was the great achievement of Routh to have derived 
an easily calculable set of independent necessary conditions. 

As an introduction to what may be called Routh’s test deter- 
minants, consider the 3X3 matrix 


Ci 03 0 

Go G 2 0 

0 Hi flj. 


(175) 


Its three naturally ordered principal minors, Ai, A 2 , A 3 , are, re- 
spectively, fl-i, g 2 g 4 - g 3 g 0 , a 3 (a lC . 2 - a 3 a 0 ), and if they are all posi- 
tive then the necessary and sufficient conditions for a cubic to be 
Routhian are realized. This suggests a general rule for a poly- 
nomial of the nth degree. List its odd coefficients m a row, 
treating all coefficients with subscripts greater than n as zero: 
a a a a • • • . Let each of these elements head a column, whose 
subscripts decrease one at each step. Follow the convention that 
any negative subscript implies that the coefficient is zero. Thus 
we have a square array of numbers, of which only the first « rows 
and columns need concern us. 


a 1 1 a 3 


g 5 ! a 3 


1 

Go 

g 2 

G 4 ; Go 

1 

0 

Gi 

1 

G3 i G5 

6 

Go 

g 2 g 4 

• 

• 

• • 

• • 


(g 0 > 0) 


(176) 


The reader should write out the matrix in lull for an 8 th and 9 th 
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degree polynomial, noting the difference between an odd and 
even n 

Routh’s theorem states that it ts necessary and sufficient in order 
for the real parts of all roots to be negative that the principal minors of 
this matrix all be positive, 1 e , 

Ai « > 0, 

A 2 ** «ta2 — coflj > 0, 


(177) 

A,-, > 0, 

A* = a„A„_i > 0 

The condition that all the o’s are positive is contained within these 
conditions, but not vice versa (except if all roots are real) 

The rigorous proof of this theorem cannot be given here The 
reader must be referred to Routh’s Adams Prize Essay A brief 
sketch of the reasoning may, howe\er, be indicated Let X be 
any point in the complex plane, and/(Z) be a mapping into another 
complex plane Consider any closed contour C in the X plane 
The number of roots of f(X) within this contour can be determined 
by an important theorem of Cauchy, 2 * which is essentially topo- 
logical in nature Traverse the contour C in the positive sense 
(counter clockwise) and consider the mapped contour of f(X) 
The number of roots in C equals the number of times that f{X) 
loops around the origin , or if we write f{X) as P + iQ the number 
of times that the ratio Pf Q passes through zero from plus to minus 
in excess of the number of times that the ratio passes through zero 
from minus to plus 

Routh takes as his contour C the right half complex plane, or 
more rigorously a limitlessly large semicircle to the right of the 
origin and the imaginary axis The changes m sign of P/Q on the 
semicircle are easily evaluated Hence, if there are to be no roots 
m the right half plane, the number of changes in sign of P/Q on 
the imaginary axis are uniquely determined , and the polynomials 
fi{X) and fi{X) must have all real roots which separate each 
other, where 

±f(tX) =MX) + iMX) 

11 See any textbook on complex variables 


(178) 



DIFFERENCE EQUATIONS 435 

By classical Sturmian theory this involves taking the neatest 
common divisor of t\(X) and f 2 (X) to get new functions f 3 (X), 
fii.X'), '• Finally, the conditions on these subsidiary functions 
can be shown to be equivalent to the Routh-Hurwitz test deter- 
minants given in (177). 

Numerically, these determinants can be easily evaluated by 
various Gauss-Doolittle methods. As an exercise the reader may 
verify that the conditions for a fourth degree equation are as 
follows: 

Oi > 0, 

Ol<2. 2 ~ &oG3 > 0, . . 

aia 2 (iz — o 0 ffl3 2 — Oi 2 o 4 > 0, 

a 4 > 0. 


It is worth noting that as we move continuously from a stable 
set of coefficients to an unstable set that a n or A n _ 4 first change sign; 
also that a reverse numbering of the a’s must satisfy the stability 
conditions. 24 

37. We have seen that for a difference equation system to be 
stable it is not necessary that the real parts of the roots of the 
characteristic equation be negative. Rather the roots must be 
less than unity in absolute value. Instead of all lying in the left 
half of the complex plane, they must all lie in the unit circle. 

For first and second degree equations it is not hard to work out 
necessary and sufficient conditions by direct algebraic means. For 
higher degree equations such methods are tortuous. Fortunately, 
by a simple use of complex transformations we can reduce our 
problem to the Routhian problem already given a complete 
answer. 25 The complex transformation 

z = (180) 
converts the unit circle into the left half of the complex plane. 

2< In P. A. Samuelson, “Conditions that the Roots of a Polynomial be less than 
Unity in Absolute Value," Annals of Mathematical Statistics, XII (1941), 360-364, a 
numerical example is worked out and some further facts are noted. If a number of 
A’s vanish, then a further test must be made. 

25 P. A. Samuelson, “Conditions that the Roots of a Polynomial be less than Unity 
in Absolute Value,” Annals of Mathematical Statistics, XII (1941), 360-364. 
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Also 

(Z + 1 \ a D (Z + 1)» + a,{Z + l )->( Z - 1) . 

f{X) =f{z~Tl) sx (Z. - 1). +••• 

(181) 

. o._x(Z + 1)(Z - l)- 1 + o„(Z - 1 y 

+ (Z^ly 

If the polynomial f(X) is to have no roots in the unit circle, 
then the numerator of the right-hand side, regarded as a poly- 
nomial in Z, must satisfy the Routhian Conditions. If we expand 
out this polynomial in the first degree case, we find as necessary 
and sufficient conditions that the root of X + at — 0 be less than 
unity in absolute value; 


1 + a x > 0, 

1 - ai > 0, 


l°il < 1 


(182) 


For the second degree polynomial, X 2 + a x X + oj = 0, the 
reader can show that the stability region of the (fli, a t ) plane is a 
triangle defined by 

1 + fli + fls > 0, 

1 - at > 0, (183) 

1 “ 4* n* > 0, 


It is to be noted that the conditions arc always one greater in 
number than the degree of the equation. This is true in the 
general case of the nth degree equation. 

f(X) = X" + oiX-i + • • • + o.-t* + = 0. (184) 

We form the equation in Z, 


c 0 Z" + ajZ"- 1 + • • . + a„_iZ + o, *= Z a,(Z ~ 1)’(Z + 1)- 


w'here 


= £«< s £Y’ ) z-‘(- 1)‘(” . ’)z — i, 

i“Q t-o j-a \ & I \ 3 J 

(<) = /!(j ■—/)!’ s = t; l =° 


(185) 


s 0, 
= 0. 


JS/; i m o 

s < l 
l < 0 


( 186 ) 
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DIFFERENCE EQUATIONS 
Collecting terms we find 

r. 

Go — Y a ii 

i — 0 

n 

Gi = Y g.(» — 2i), 

i= 0 


Or = 


n n 


i=0 i=0 




(IS?) 


G„ = 1 - Gl -fi ®2 - **' 4- (~ i) n ~ I G-n_i -f- (~ 1)'-G n . 

In addition to the condition, 

Go > 0, 

we have the Routhian conditions 

Ai > 0, 

A 2 > 0, 

(1SS) 


A„ > 0, 

where these are naturally ordered principal minors of 


Gi 

o 3 

G fi • • • 

Go 

c 2 

di 

0 

Gi 

a s ••• 

0 

Go 

c 2 • • • 

; 


* • • • 

• • • • 


The reader may verify that for a cubic equation, 

Go = 1 + Gi + C 2 + Gj. 

di — 3 + G-i — g 2 — 3g 3 , 

0 2 = 3 — Gi — c 2 + 3 g 3 , 

03 = 1 — Gi + G 2 — c 3 . 


(1S9) 
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38 Formally, our stability criteria are complete But for 
most theoretical purposes numerical values of the various coeffi 
cients are not at hand It would be highly desirable to be able to 
infer from the qualitative properties of our dynamical matrices 
even before their characteristic determinants have been (labon 
ously) expanded out whether or not they are stable 

Unfortunately, this is rarely possible Often, however, certain 
very general sufficient conditions can be indicated to guarantee 
that a differential equation is stable — or as I shall term it Routhian 
or to indicate that a difference equation is stable with moduli less 
than unity — or as I shall term it Tinbergeman in honor of Pro 
fessor Tinbergen’s outstanding work with difference equations in 
economics 

The following theorems are listed more or less without proof 

Theorem 1 x — axis stable, if the matrix a is symmetrical and 
negative definite i e , a is a Routhian matrix (Sec Mathematical 
Appendix A, page 19, passim for this classical fact ) 

Theorem 2 If a is quasi definite, so that — is nega 
tive definite, then a is Routhian ^Hint ^ (X'AT) *= X'aX 
= X ’ ^ X < 0 Therefore, Um *,(/) = 0 and roots must 

be damped ^ 

Theorem 3 If a' a — I is negative definite, then a is Tinbergeman , 
i e , X l+ i = aX t is stable, and roots of ] a — XI j = 0 are all less than 
unity in absolute value (Hint Xt+iXt+i = Xt{a'a — T)Xi 
=* A(£ x,*) < 0 by hypothesis Therefore, iim X t = 0 ) 

Theorem 4 (Metzler) If a is “Hicksian” as defined in chapter 
VI, with all its off diagonal elements positive, then it is necessarily 
Tinbergeman 

Theorem 5 (Metzler) If an gO and £ a,, < 1, then (a — /) is 

Hicksian, and hence (by Theorem 4) is Tinbergeman 

Theorem 6 (Mosak) If a is Hicksian, and if the off diagonal 
elements of a are all positive, then all of the elements of or 1 are 
egative 



DIFFERENCE EQUATIONS 439 


Theorem 7 (Oppenheim-Yntema) : If all the symbols are posi- 
tive in 



(pi + £ a ij )5 ii \ 

i 

& a 

bks 

(Wfc + £ h,) 8*. 

f 


then the pattern of signs in A~ l is 


+ 

... + 

- 

■ 

+ 

... + 

• 



: 

... _ 

+ ••• 

" + 

• 

• 

+ ••• 

+ 
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